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ABSTRACT 

We detect the large-scale structure of Lya emission in the Universe at redshifts z = 
2 — 3.5 by measuring the cross-correlation of Lya surface brightness with quasars in 
the Sloan Digital Sky Survey (SDSS/BOSS). We use nearly a million spectra targeting 
Luminous Red Galaxies (LRGs) at z < 0.8, after subtracting a best fit model galaxy 
spectrum from each one, as an estimate of the high-redshift Lya surface brightness. 
The quasar-Lya emission cross-correlation we detect has a shape consistent with a 
linear ACDM model with = O.SOffhJ. The predicted amplitude of this cross¬ 
correlation is proportional to the product of the mean Lya surface brightness, (ft a ), 
the amplitude of mass density fluctuations, and the quasar and Lya emission bias 
factors. Using published cosmological observations to constrain the amplitude of mass 
fluctuations and the quasar bias factor, we infer the value of the product (fi a ) (b a / 3) = 
(3.9±0.9) x 10 -21 ergs -1 cm -2 A -1 arcsec 2 , where b a is the Lya emission linear bias 
factor. If the dominant sources of Lya emission we measure are star forming galaxies, 
we infer a total mean star formation rate density of psfr = (0.28 ± 0.07)(3/6 a ) yr -1 
Mpc -3 at z = 2 — 3.5. For b a = 3, this value is a factor of 21 — 35 above previous 
estimates relying on individually detected Lya emitters, although it is consistent with 
the total star-formation density derived from dust-corrected, continuum UV surveys. 
Our observations therefore imply that 97% of the Lya emission in the Universe at 
these redshifts is undetected in previous surveys of Lya emitters. Our detected Lya 
emission is also much greater, by at least an order of magnitude, than that measured 
from stacking analyses of faint halos surrounding previously detected Lya emitters, but 
we speculate that it arises from similar low surface brightness Lya halos surrounding 
all luminous star-forming galaxies. We also detect a redshift space anisotropy of the 
quasar-Lya emission cross-correlation, finding evidence at the 3.0cr level that it is 
radially elongated, contrary to the prediction for linear gravitational evolution, but 
consistent with distortions caused by radiative-transfer effects, as predicted by Zheng 
et al. (2011). Our measurements represent the first application of the intensity mapping 
technique to optical observations. 
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1 INTRODUCTION 

The Lya emission line of neutral hydrogen is a strong fea¬ 
ture that has been used to detect galaxies at a wide range 
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of redshifts (e.g., Hu & McMahon 1996, Keel et al. 1999, 
Fujita et al. 2003, Cowie et al. 2010). Another potentially 
useful technique is that of intensity mapping (e.g., Carilli 
2011, Peterson & Suarez 2012, Pullen et al. 2014), which 
seeks to map the large-scale structure using one emission 
line or more (see e.g., Wyithe & Morales 2007, Visbal & 
Loeb 2010), without resolving individual sources (such as 
galaxies or gas clouds). By measuring this structure, one 
is sensitive to all clustered emission, without the observa¬ 
tional biases which arise from source detection and luminos¬ 
ity measurement (such as detection limits, determination of 
backgrounds and finite aperture size). In this paper, we seek 
to perform the first cosmological measurement of intensity 
mapping in the Lya line, using a large dataset of spectra 
from the Sloan Digital Sky Survey III (SDSS-III, Eisenstein 
et al. 2011) Baryon Oscillation Spectroscopic Survey (BOSS, 
Dawson et al. 2013). We use spectra that were targeted at 
massive galaxies at z < 0.8. After subtracting best fit model 
galaxy spectra, we expect that any high redshift Lya emit¬ 
ters that are within the fiber aperture result in a residual 
flux, present also in sky fibers. Even if not detectable as 
individual sources, we can search for large-scale structure 
in this emission by determining its spatial cross-correlation 
function with the positions of BOSS quasars, which are trac¬ 
ers of structure with a known bias factor (e.g., White et al. 
2012) at redshifts z > 2, where the Lya emission line is in 
the optical part of the spectrum. 

Following the early prediction by Partridge & Peebles 
(1967) that galaxies should be detectable at high redshift 
from their Lya emission line, many surveys have been de¬ 
signed to detect individual galaxies as sources of Lya emis¬ 
sion. These include narrow band imaging (e.g., Ouchi et al. 
2003, Gronwall et al. 2007), serendipitous slit spectroscopy 
(e.g., Cassata et al. 2011) and integral field spectroscopy 
(e.g., van Bruekelen et al. 2005, Blanc et al. 2011). These 
techniques have resulted in the compilation of catalogs of 
several hundred to a few thousand Lya emitting galaxies 
from redshifts z ~ 2.1 to the redshifts associated with the 
end of reionization. These samples have been used to show 
that Lya emitters of line luminosity L a = 10 42 ergs -1 found 
at 2 = 3 have space densities of ~ 10 -3 Mpc -3 (e.g., Gawiser 
et al. 2007, Cassata et al. 2015) and are therefore expected 
to be the progenitors of L* galaxies at redshift z = 0. The 
clustering of these galaxies has been measured on scales of 
up to 10 Mpc by Gauita et al. (2010) and Gawiser et al. 
(2007), who find that at redshifts 2 = 2 — 3 they have a 
bias factor with respect to the underlying matter (in CDM 
models) of b ~ 1.5 — 2. Integrating the luminosity functions 
of Lya emitting galaxies, assuming a power-law extrapola¬ 
tion for the faint end slope, has revealed that the comoving 
volume emissivity of Lya photons declines significantly from 
z ~ 6 to z ~ 2 (Cassata et al. 2011, Gronwall et al. 2007, 
Ouchi et al. 2008). This behaviour can be compared to the 
opposite evolution in redshift of galaxies measured in opti¬ 
cally thin par ts of the rest-fram e UV spectrum (or using the 
Ha line, e.g.. lHaves ct al. 20l l]). This co mpa rison has been 
used to infer (e.g., by Haves et al. !lm| and ICassata et al.l 
1201 ill that the escape fraction of Lya photons produced in 
star forming regions has significantly decreased from z = 6 
to 2 = 2. 

Because Lya photons have a high cross-section for scat¬ 
tering off neutral hydrogen, extended Lya emission is ex¬ 


pected to be common in many environments. For example, 
Lya radiation from star forming regions in galaxies should 
undergo hundreds or thousands of scatterings in gas in any 
circumgalactic medium before finally escaping or else be¬ 
ing absorbed by dust. The existence of a general fluorescent 
emission from the intergalactic medium was also hypothe¬ 
sised by Hogan & Weymann (1987) and Gould & Weinberg 
(1996). Theoretical work applying line radiative transfer on 
gas distribution in cosmological hydrodynamic simulations 
has made predictions for Lya emission around galaxies and 
quasars (e.g., Cantalupo et al. 2005, Laursen et al. 2007, 
Kollmeier et al. 2010, Zheng et al. 2011a), as well as metal 
line emission (Bertone & Schaye 2012). These studies have 
resulted in predictions of extended Lya halos around galax¬ 
ies with sizes of hundreds of kpc, with a strong dependence 
of their properties on environment that can lead to new ef¬ 
fects on galaxy clustering (Zheng et al. 2011a). 

Observational evidence for extended emission includes 
the discovery and characterization of the so-called Lya 
“blobs” (Steidel et al. 2000). Deep spectroscopic searches 
for diffuse Lya emission have been completed by Rauch 
et al. (2008), finding faint Lya emitting galaxies. Stacking 
of spectra of damped Lya absorbers in quasars has also pro¬ 
duced measurements of residual Lya emission (Rahmani et 
al. 2010, Noterdaeme et al. 2014). Recently, Martin et al. 
(2014a,b) published the first results from the Cosmic Web 
Imager, an integral field spectrograph designed to map low 
surface brightness emission, detecting Lya emission from fil¬ 
amentary structures around a 2 = 2.8 quasar as long as 
250-400 proper kpc. Diffuse Lya halos around high redshift 
galaxies have been found to be ubiquitous by Steidel et al. 
(2011) and Matsuda et al. (2012). Momose et al. (2014) have 
assembled several samples of up to 3600 Lya emitters from 
Subaru narrowband imaging at a range of redshifts from 
2 = 2.2 to 2 = 6.6 and, after controlling for atmospheric 
and instrumental artifacts, they detect diffuse extended Lya 
halos with exponential scale lengths of ~ 5 — 10 kpc from 
2 = 2.2 — 5.7. The large scale studies in our paper are an 
alternative, complementary observational strategy to these 
earlier studies, which involve deep integrations over small 
fields of view. 

All of these sources should be clustered on large scales 
and should contribute to the mean Lya emission inten¬ 
sity in the Universe. This mean emission is detectable if 
it cross-correlates as expected with other tracers of large- 
scale structure that we can observe at the same redshift. We 
shall use the quasars found by BOSS at 2 > 2 to correlate 
with Lya emission in this work. This clustered Lya emis¬ 
sion is extremely faint, but as we shall demonstrate it can 
be detected with BOSS thanks to the enormous number of 
spectra that are observed. While large-scale clustering mea¬ 
surements cannot easily allow separation of the signal into 
various sources, we may expect faint Lya emitting galaxies 
to dominate over quasars due to their much larger number 
density. If this is the case, then the mean Lya emission inten¬ 
sity clustered with quasars can be used as a measure of the 
global star formation rate (e.g., Cassata et al. 2011), times 
the mean (luminosity weighted) bias factor of the distribu¬ 
tion of these galaxies. Our measurement of Lya emission 
will therefore be useful as a probe of star formation which 
takes into account all sources of Lya emission, reaching to 
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arbitrarily faint luminosities and surface brightnesses from 
extended halos to faint galaxies. 

The structure of this paper is as follows. Section 2 de¬ 
scribes the data samples we use in our work, which include 
the galaxy and sky spectra from SDSS DR10, along with the 
quasar catalog from that data release. We present our mea¬ 
surement of quasar-Lya emission correlations in Section 3, 
including the evolution with redshift and clustering parallel 
and transverse to the line of sight. Section 4 describes our 
tests involving fitting and subtraction of emission lines. In 
Section 5, we convert our determination of the Lya surface 
brightness into a star formation rate density and compare 
to other measurements. In Section 6, we summarise our re¬ 
sults and in Section 7 discuss them further. There are also 
three appendices to the paper, A-C; in these we measure 
stray light contamination, determine a large-scale surface 
brightness correction, and perform some sample tests of our 
results. 


2 DATA SAMPLES 

This study makes use of data from the S DSS BOSS survey 
Data Release 10 ('DRIP. [Aim et akll2014l l. including quasar 
position and redshift data, galaxy spectra and sky fiber spec¬ 
tra. The SDSS camera and telescope are described in Gunn 
et al. (1998) and Gunn et al. (2006), respectively. Full infor¬ 
mation on the SDSS/BOSS spectrographs can be found in 
Smee et al. (2013). The wavelength coverage of the spectro¬ 
graph is from A =3560 A to 10400 A the resolving power is 
R ~ 1400 for the range A = 3800 A— 4900 A, and is kept 
above R = 1000 for the remainder of the wavelength range. 
The fibers have a diameter of 120 /im, corresponding to 2 
arcsec. in angle. We restrict the redshift range of data we use 
in our analysis to 2.0 < z < 3.5, due to the spectrograph 
cutoff at low redshift and the limited number of observed 
quasars at high redshift. 

2.1 Spectra 

The 987482 galaxy spectra in our sample are of targeted 
LRGs which are within redshifts z ~ 0.15 and 2 ~ 0.7. The 
redshift range of the original targets is not important to our 
study, as for each spectrum we make use only of the pixels for 
which the Lya emission line lies within the redshift range 
specified above ( 2.0 < 2 < 3.5). In observed wavelength 
units this is 3647 A to 5470 A. We also make use of 146065 
sky fiber spectra. 

The main BOSS LRG program consists of two galaxy 
target samples (see Dawson et al. 2013), designated CMASS 
(for constant mass) and LOWZ (for low-redshift). The 
LOWZ galaxy sample is composed of massive red galax¬ 
ies spanning the redshift range 0.15 ^ 2 ^ 0.4. The CMASS 
galaxy sample is composed of massive galaxies spanning the 
redshift range 0.4 A. 2 A. 0.7. Both samples are color-selected 
to provide near-uniform sampling over the combined volume. 
The faintest galaxies are at r = 19.5 for LOWZ and i = 19.9 
for CMASS. Colors and magnitudes for the galaxy selec¬ 
tion cuts are corrected for Galactic extinction using Schlegel 
et al. (1998) dust maps. We do not differentiate between 
CMASS and LOWZ samples in our analysis. 

The spectroscopic measurement pipeline for BOSS is 


described in detail in Bolton et al. (2012). The most im¬ 
portant data products that are used in the present anal¬ 
ysis are: (a) Wavelength-calibrated, sky-subtracted, flux- 
calibrated, and co-added object spectra, which have been 
rebinned onto a uniform baseline of A log 10 A = 10 -4 (about 
69 km s -1 pixel -1 ), (b) Statistical error-estimate vectors for 
each spectrum (expressed as inverse variance) incorporating 
contributions from photon noise, CCD read noise, and sky- 
subtraction error, (c) Mask vectors for each spectrum. 


2.1.1 Data preparation 

For each of the LRG spectra, we subtract the best fit model 
spectrum provided by the pipeline. This template model 
spectrum (see Bolton et al. 2012 for details) is computed 
using least-squares minimization comparison of each galaxy 
spectrum to a full range of galaxy templates. A range of red- 
shifts is explored, with trial redshifts spaced every pixel. At 
each redshift the spectrum is fit with an error-weighted least- 
squares linear combination of redshifted template eigenspec- 
tra in combination with a low-order polynomial. The polyno¬ 
mial terms absorb Galactic extinction, intrinsic extinction, 
and residual spectrophotometric calibration errors (typically 
at the 10% level) that are not fully spanned by the eigen- 
spectra; there are three polynomial degrees of freedom for 
galaxies. The template basis sets are derived from restframe 
principal-component analyses (PCA) of training samples of 
galaxies, and have four degrees of freedom (eigenspectra). 

After subtraction of the best fittting template spectrum, 
we compute the average residual spectrum of all galaxies. 
This is displayed in Figure [T] where the horizontal axis is 
labelled in units of the redshift of the Lya line. Figure[l]also 
presents the mean sky spectrum and the sky-subtracted sky 
spectrum. 

We can see that the residual surface brightness per unit 
wavelength (hereafter shortened to “surface brightness”- we 
use this term to refer to the quantity measured through¬ 
out the paper, which is most precisely the flux density per 
unit solid angle per unit wavelength) in the galaxy fibers 
is within ±10 -19 ergs -1 cm -2 A 1 arcsec -2 for most of the 
redshift range. There are however significant excursions cor¬ 
responding to features including the zero redshift Calcium 
H and K lines (at 3969 and 3934 A) and a strong Mercury 
G line from streetlamps (at 4358 A). In our analysis we sub¬ 
tract the mean residual surface brightness, from all spectra 
before cross-correlating them as we are only interested in 
the fluctuations in the Lya surface brightness. In order to 
reduce noise, we also mask two regions corresponding to 
large features in the residual surface brightness, 40 A and 
30 A windows centered on wavelengths 3900 A and 4357 A 
respectively (corresponding to redshifts 2 = 2.21 and 2.58). 

Comparing the sky fiber and galaxy fiber residual spec¬ 
tra, we can see that there are differences at the ~ 10 -19 
ergs -1 level over much of the spectra. We attribute these to 
galaxy surface brightness that was not subtracted perfectly 
by the galaxy model. In our cross-correlation technique for 
measuring clustering in the Lya emission we necessarily sub¬ 
tract the mean surface brightness, therefore residual fluctu¬ 
ations seen in Figure [T] are not problematic except for the 
noise they contribute. 
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Figure 1. Black line: the average residual surface brightness in 
all 987482 LRG spectra after subtraction of the best fit galaxy 
model. The bottom horizontal axis is in units of the redshift of 
the Lya line and the top in units of observed wavelength. Red line 
(top): the average sky fiber surface brightness in all 146065 sky 
fiber spectra. Cyan line: the average sky fiber surface brightness 
with model sky subtracted from all sky fiber spectra. In all curves, 
the prominent emission line at wavelength A = 4358Ais due to 
terrestrial airglow from Mercury streetlamps (the Hg G-line). 

2.2 Quasars 

We use quasars from the SDSS/BOSS DR10 catalogue (Ahn 
et al. 2014). The quasar target sample included both color- 
selected candidates and known quasars (Bovy et al. 2011; 
Kirkpatrick et al. 2011; Ross et al. 2012). The candidate 
quasar spectra were all visually inspected and redshift esti¬ 
mates computed using a principal component analysis (see 
Paris et al. 2012 for the details of the procedure as applied 
to DR9 quasars). We select the 130812 quasars in the DR10 
dataset that have redshifts in the range 2.0 < z < 3.5. Be¬ 
cause the galaxy pixels cover this redshift range uniformly, 
the central redshift of our measurements is the mean redshift 
of these quasars, z = 2.55. 


3 QUASAR-Lya EMISSION 

CROSS-CORRELATION 

Before computing the quasar-Lya emission cross¬ 
correlation, we first split the sample of galaxy spectra 
into 100 subsamples of approximately equal sky area based 
on contiguous groupings of plates. We then convert the 
galaxy spectrum pixels and the quasar angular positions 
and redshifts into comoving Cartesian coordinates using a 
flat cosmological model with matter density D m = 0.315, 
consistent with the Planck, Ade (2014) results (cosmological 
constant density Da = 0.685). This fiducial model is used 
throughout the paper. 

We compute the quasar-Lya emission surface brightness 
cross-correlation, £ qa (r) , using a sum over all quasar-galaxy 


spectrum pixel pairs separated by r within a certain bin: 

1 Af(r) 

£qa( r ) = N (r) ^2 w ri (1) 

Z^i= 1 W ri i =1 

where N(r) is the number of pixels in the bin centered on 
quasar-pixel distance r, and = fi r i — (fj,(z)) is the resid¬ 
ual surface brightness in the spectrum at pixel i for the bin 
r. Note here that we have a different list of pixels labeled 
as i for each bin in the separation r between a pixel and a 
quasar, which has Lya surface brightness The residual 
flux at each pixel is obtained by subtracting the mean at 
each redshift, ( fj,(z )). We weight each pixel by w r i = l/of, 
where of is the pipeline estimate of the inverse variance of 
the flux at each pixel. We first present our results as a func¬ 
tion of only the modulus of the quasar-pixel separation r 
in comoving /i _1 Mpc, in 20 bins logarithmically spaced be¬ 
tween r = 0.5/i _1 Mpc and r = 150/i _1 Mpc. In Section [3.31 
we will also examine redshift space anisotropies in the cor¬ 
relation function £ qct by considering bins in the parallel and 
perpendicular components of r, using the same formulation. 

When evaluating equation [T] a possible signficant sys¬ 
tematic error is caused by stray light from the quasars them¬ 
selves contaminating spectra of nearby galaxies. This occurs 
because the light from the various fibers is dispersed onto a 
single CCD, so that extraction of each spectrum along one 
dimension (Bolton et al. 2012) may include light from adja¬ 
cent fibers of bright sources. We see strong evidence of this 
stray light from quasars in galaxy spectra when the quasar 
and galaxy spectra are four fibers apart or fewer, in the 
list of fibers as they are ordered in the CCD. The effect is 
discussed in detail in Appendix A. When the galaxy and 
quasar spectra in a pair are more than 4 fibers apart, we 
see no evidence for this contamination, and the results are 
statistically consistent with using only pairs of quasars and 
galaxies on different plates (see Appendix A). In order to 
safely eliminate this stray quasar light when computing the 
flux cross-correlation with equation [l] we therefore apply 
the constraint that the quasar and galaxy fibers must be at 
least six fibers apart. 

There is also the possibility that some clustering in the 
plane of the sky is generated by effects (e.g., galactic ob¬ 
scuration) which modulate both Lya surface brightness and 
quasar target selection. Appendix B presents measurements 
of ( qo for quasar-pixel pairs which are close together on 
the sky (i.e., in the transverse separation) but widely sep¬ 
arated along the line of sight. This measurement enables 
us to quantify how much clustering could be caused by ef¬ 
fects such as Galactic obscuration and to compute a £ qct 
correction term to be subtracted from our fiducial cluster¬ 
ing result. We also measure £ qa for pairs which are close 
in the line-of-sight separation but widely separated on the 
sky. This latter measurement constrains how much spurious 
clustering is caused by large-scale variations in the line-of- 
sight direction, for example redshift evolution in the effi¬ 
ciency of galaxy subtraction, or flux calibration errors with 
wavelength that may be associated with sky lines. We apply 
the corrections to £ qa from Appendix B to our analysis be¬ 
low and in the other sections of the paper. We discuss the 
small-scale anisotropy of £ qa in Section ECU below. For now 
we note that application of the correction factors described 
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above changes the amplitude and shape of our measured £ qa 
by less than lcr in all cases. 

We perform the pairwise computation of equation |T| for 
each of our 100 subsamples, and then compute the mean 
and standard deviation of £ qa (r) using a Jackknife estima¬ 
tor. The Jackknife estimator is also used to compute the 
covariance matrix of £ q a (r) : 

M 

Cii = y J ^qrv./r (y 7 .) — ^ q o:(r,)] [£ qQ!j fc {rj ) — Cqa( r j)]i (2) 

fc =1 

where £q a ,k(ri) is the cross-correlation in bin i for Jackknife 
sample k, £ qct (r;) is the cross-correlation for bin i for the full 
dataset, and the number of Jackknife samples is M = 100. 

3.1 Fiducial result 

We show £ qa (r) for our fiducial sample (which is the en¬ 
tire dataset over the redshift range 2.0 < z < 3.5) in Figure 
[2] The mean Lya redshift of the galaxy pixels in this sam¬ 
ple is 2 = 2.71 and of the quasars z = 2.55. Because the 
galaxy pixel distribution is uniform in redshift, and because 
quasar-pixel pairs with small separations contribute most 
to the clustering signal, we adopt the effective mean red¬ 
shift of our fiducial measurement to be 2 = 2.55. The cross¬ 
correlation function is in units of the surface brightness of 
Lya emission, and its amplitude is directly proportional to 
that surface brightness. The £ qQ ,(r) points reveal that there 
is significant measurable large-scale structure present in the 
Lya emission, on scales from 1 to ~ 15/i _1 Mpc. Figure[2]also 
displays a linear CDM fit to the cross-correlation function, 
which is consistent with observational results on scales from 
1 — 100/i -1 Mpc. We turn to this fit in the next subsection. 

3.2 Model fit 

If the Lya emission clustering is due to a linearly biased 
version of the density field, then a model for the isotropically 
averaged quasar-Lya cross-correlation £ qa (r) is as follows: 

£ q a(r) = b q b a fp(p a )£(r) (3) 

where (p a ) is the mean surface brightness of Lya emission, 
b q and b a are the quasar and Lya emission linear bias fac¬ 
tors, and £(r) is the linear ACDM correlation function. 

It is important to note that b a is the bias factor for Lya 
surface brightness fluctuations, and is different in definition 
from the usually quo ted bias factor of Ly a e mitters, 6lae 
(e.g.. as measured bv lGawiser et al1l2007l and iGuaita et all 
(2010). The bias factor 6lae reflects the relation between the 
fluctuations 5 n in the number density n of Lya emitters and 
that in the matter density 5, 

5 n = n ~ = b LAE J, (4) 

W 

where 5 = (p — (p))/{p) and p is the matter density field. 
The factor b a in Equation [3] relates fluctuations 8 M in the 
Lya surface brightness p to matter fluctuations according 
to 

= ba 5. (5) 

M 

In the absence of radiative transfer effect (|Zheng et al.1 



1 10 100 
r (h -1 Mpc) 


Figure 2. The quasar-Lya emission cross-correlation function, 
£q a(r) (see Equation [TJ. The points represent results for the 
fiducial sample that covers redshift range 2.0 < 2 : < 3.5. The 
error bars have been calculated using a jackknife estimator and 
100 subsamples of the data. The smooth curve is a best fit linear 
CDM correlation function (see Section 13. 2 The top panel shows 
the £qa(r) results with a log y-axis scale, and the bottom panel 
displays r£q a (r) on a linear scale in order to allow points which 
are negative to be visible. 

1201 lal : see Section [6]4]), the Lya surface brightness p is pro¬ 
portional to the Lya luminosity density pL of the underlying 
star-forming galaxy population. The fluctuations of the 
latter can be characterized by the bias factor bL, 

5 l = pL y ^ = b L S, (6) 

\PL ) 

and we have b a = bL- As bL reflects weighting by luminos¬ 
ity rather than by number, it is likely to be significantly 
higher than 6 lae, because higher luminosity emitters tend 
to be more strongly clustered. We will return to this topic 
in Section [5] Radiative transfer effect leads to a modifica¬ 
tion in the relation b a = bL- In a simple model, we have 
ba = b L + ai with ai a positive number (see Section [6.411 . 
Overall, we expect b a to be substantially higher than 6 lae- 
In Equation [3] /a is a constant enhancement to the cor¬ 
relation function on linear scales of the form that is caused 
by peculiar velocity redshift-space distortions (Kaiser 1987). 
We use the linear CDM transfer function of Lewis et al. 
(2000) to compute £( r )- In our computations we choose to 
vary the shape of the correlation function by changing fl m , 
the matter density, in the context of the ACDM model, keep- 
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ing the other parameters which influence the shape (such as 
h, and the baryon density fib) fixed. Parameters have been 
reported for the best fit ACDM model to the Planck Satel¬ 
lite data by Ade et al. 2014. We assume Planck values for 
fib = 0.049 and the spectral index n s = 0.9603 but set 
h — 0.7. Note that we are merely using fl m to parametrize 
the shape of the correlation function to see if it is consistent 
with other observations, and are not presenting our results 
for fl m as properly marginalized measurements of that pa¬ 
rameter. 

The other free parameter is the amplitude, b q b a fp{n). 
We assume in all cases that the underlying amplitude of 
mass fluctuations as(z = 0) = 0.83, and therefore that 
as(z = 2.55) = 0.294, again consistent with Ade et al. 2014. 

We fit our model to the data in Figure [5] by varying 
these two parameters. The \ 2 value is given by 

x 2 = EiC(^) - c- c°Vt)i (7) 

N 

where the sum is over the N = 20 bins, £qa S (ri) is the ob¬ 
served cross-correlation measured in bin i, £^ d (r;) is the 
model prediction for bin i, and Cij is the covariance matrix 
computed using our 100 Jackknife samples in Equation [2] 

The best fit values and one sigma error bars are as fol¬ 
lows: 

bqbafpifi ) = 3.33^0 43 x 10 -2 °ergs -1 cm -2 A 1 arcsec -2 , 

( 8 ) 

and fl m = 0.296^0 ^ 4 . The shape parameter fl m is con¬ 
sistent with the best fit value from the Planck satellite re¬ 
sults (Ade et al. 2014, f2 m = 0.30). The 1, 2 and 3 a confi¬ 
dence contours in these parameters considered together (i.e. 
Ax 2 =2.3, 6.17 and 11.8) are displayed along with the best 
fit values in Figure [3] 

When carrying out our linear fit to the isotropically 
averaged correlation function we are neglecting non-linear 
effects and redshift measurement errors which are expected 
to suppress the correlation function on small scales (Davis 
and Peebles 1983). We are also not including in our model 
of Equation[3]the one-halo term (see e.g., Cooray and Sheth 
2002) representing structure inside virialized halos, which 
would cause the correlation function to be increased at small 
scales. An estimate of the size of the redshift-space distor¬ 
tion effects is given below. There is some evidence for sup¬ 
pression of redshift-space distortions o f the form that is ex - 
pected from radiative transfer effects dZheng et al.ll2011al 'l. 
The effect of the suppression terms in the cross-correlation 
function will be of opposite sign to the boost due to a one- 
halo term, so that the partial cancellation will further lower 
the impact of these effects. Because of these interactions, 
we leave more detailed nonlinear and halo modelling of the 
correlation function to future work. 

3.3 Clustering transverse and parallel to line of 
sight 

We also compute the quasar-Lya cross-correlation as a func¬ 
tion of t|| and rj_, the quasar-pixel pair separation along and 
across the line of sight, shown in Figure[4l on a linear scale. 
We can see that the contours are relatively symmetric about 
the r|| = 0 axis and somewhat stretched along the ry direc¬ 
tion. Font-Ribera et al. (2013) found a redshift offset be¬ 
tween quasars and the Lya forest of 5 Z = — 160kms _1 , due 
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Figure 3. Fit parameters for the amplitude b q b a fp (ji) and shape 
f2m (for fixed h and other parameters) of a linearly biased CDM 
model fit to the Lya cross-correlation function plotted in Figure 
[2] The dot indicates the best fit parameters and the contours show 
the 1, 2 and 3 a confidence contours. 

to the quasar catalogue redshifts being on average too small 
by this amount. This correlation resulted in the quasar- 
Lya forest cross-correlation being shifted upwards by this 
amount. The precision of the quasar-Lya emission cross¬ 
correlation in our paper is smaller, but visual inspection 
of Figure [4] reveals that the position of the centroid of the 
cross-correlation is consistent with a small upward shift of 
this magnitude (1.6h _1 Mpc at this redshift). 

On the scales where the cross-correlation is easy to dis¬ 
cern (r 20h~ 1 Mpc), there is no sign of compression due 
to linear infall (Kaiser 1987). The redshift-space distortions 
should be less prominent compared to the Lya forest be¬ 
cause of the expected higher bias for the Lya emission. In 
reality there appears to be stretching along the line of sight 
(we quantify this below), which might be due to a combina¬ 
tion of quasar redshift errors, the intrinsic velocity disper¬ 
sion of quasars in their host halos, or the intrinsic velocity 
dispersion of the sources of Lya emission. 

Another source of apparent clustering anisotropy could 
be the radiative transfer effects predicted by Zheng et al. 
(2011a). It was shown by these authors, using cosmologi¬ 
cal radiation hydrodynamic simulations, that Lya radiative 
transfer has a strong environmental dependence which can 
cause the apparent spatial distribution of Lya emission to 
become anisotropic with respect to the line-of-sight direc¬ 
tion. Density fluctuations along the line-of-sight direction 
are found to preferentially emit the Lya radiation in that 
direction in overdense regions, mainly because of the effect 
of peculiar velocity gradients on the Lya radiative transfer. 
This causes a suppression of the line-of-sight fluctuation, 
which can be modeled similarly to the Kaiser effect (also 
caused by the peculiar velocity gradient), even though the 
sign of the effect is opposite. 

3.4 Fitting redshift-space distortions 

In order to approximately quantify the level of distortion 
in Figure [4] and its statistical significance, we have inves¬ 
tigated fitting a redshift-space distortion model to the £ qct 
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Figure 4. Left panel: The quasar-Lya cross-correlation £ qct as a function of ry and rj_. The units (of Lya surface brightness) are 
the same as in Figure [2] The contours are spaced at values of 10 -21 ergs -1 cm -2 A 1 arcsec -2 . To reduce noise in the image, the 
dataset was smoothed with a Gaussian filter with cr = 4//' 'Mpc (2 cells) before plotting. Right panel: The model fit to the quasar-Lya 
cross-correlation including redshift-space distortions (see Section 13.411 . 


(rj_,r||) data. To compute the model for £ qa (r_L,r||), we 
first assume the linear ACDM correlation function shape 
used in Equation [3] and then use a model for peculiar ve¬ 
locities to distort it in redshift space. Our pecular velocity 
model includes standard linear infall for large scale flows 
(Kaiser 1987) and a small scale random velocity dispersion 
(e.g., Davis and Peebles 1983). 

The parameterization of the model for linear infall al¬ 
lows for stretching (outflow) as well as squashing (infall) 
along the line of sight. Although gravitational process- 
eses are expected to result in infall, as mentioned above 
Zheng et al. (2011b) have shown that radiative transfer ef¬ 
fects on the anisotropy of clustering can be approximately 
parametrized with the same model. We do this here, allowing 
a net linear outflow measured by the model to be interpreted 
as the radiative transfer effect. 

The effects of coherent flows on the correlation function 
in linear theory were presented by Hamilton (1992). We use 
the formulation of Hawkins (2003), with modifications to 
make it appropriate for the case of cross-correlation func¬ 
tions. This modification involves the use of the two bias 
factors b q and b a from Equation [3] to compute redshift- 
space distortion factors /3 q = fl m (s = 2.55)°’ 6 /fo q and 
P a = flm (z = 2.55 ) 0 ' 6 /b a - The linearly distorted quasar- 
Lya emission cross-correlation function is then given by 


£qa(n-,G|) = bqb a (n<x ) [6 (s)Po (aO + 6 (s )Pi (/Li) + £4 (s)P 4 (n )] . 

(9) 


where n = ry/r, and 

£o(«) = 


1 + -(/3q + Pa) + ~PnPa 


Z(r) (10) 


6 («) = 


(Aj + Pa) + y/3q/3a 


Kto-£(r)], (11) 


with 


&(s) 



O' 


£(?) + ^(r) 






£(r) = 


_5_ 



i dr'. 


( 12 ) 

(13) 

(14) 


Here £(r) is the linear ACDM correlation function of equa¬ 
tion 

We use these relations to create a model £ qct (r\i., ry) 
which we convolve with the distribution function of ran¬ 
dom pairwise motions, f(v), to produce the final model 
£qa(r_L,r||): 


£qa(r±,ry) = 


„) = />( 


rx,ry 


(1 + z)v 
H(z = 2.55) 


f(v)dv 


(15) 

The random velocity dispersion we use is an exponential 
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Figure 5. 1, 2 and 3 cr confidence contours for the redshift-space 
distortion parameters (3 and a found from the fit to the quasar- 
Lya emission cross-correlation (Section 3.4). The best fit values 
of /3 and cr are shown with a dot. 

model, thus the distribution function of velocities is 

(w) 

where a is the pairwise velocity dispersion of quasars and 
Lya emission, which we assume to be independent of pair 
separation. 

There is a strong degeneracy between /? q and /3 a , but 
the bias factor for BOSS quasars is reasonably well mea¬ 
sured. The first BOSS measurement, of b q = 3.6 ± 0.6, was 
made using the quasar autocorrelation function by White 
et al. (2012). Font-Ribera et al. (2013), find an even more 
precise value of b q = 3.64lto 15 from the cross-correlation of 
BOSS quasars with the Lya forest. Because of this we set 
6 q = 3.64, giving /3 q = 0.27. The free parameters in our 
distortion model are therefore fi a , a and an amplitude pa¬ 
rameter b q b a (na )• 

We set the parameter governing the shape in our model, 
S2 m , equal to the Planck value, = 0.30. We compute 
( qo (rj_,r||) for our model for a grid of values of varying 
Pa, cr, and b q b a {fj,). We then compare our model to the ob¬ 
served £ qQ , (t"_l , yn) from Figure 2] performing a \ 2 fit to all 
points within r = 40h“ 1 Mpc. We again use jackknife error 
bars computed from 100 subsamples, but because of diffi¬ 
culties with noisy matrix inversion, we do not use the off- 
diagonal (Cijtj) terms when inverting the covariance matrix. 
We marginalize over the amplitude parameter b q b a {Ha ), 
and show the confidence contours (for the two remaining 
degrees of freedom) in Figure [5] 

We have allowed the p a parameter to be negative in our 
fit not because we believe that outflow of Lya emission is 
likely around quasars but, as mentioned above, because this 
allows quantifying the stretching along the line of sight seen 
in Figure U and its significance. Our best fit values with 1 a 
error bars are /3 a = —0.76 ±0.36 and a = 490 ± 300 km s^ 1 . 
From these values and by observing the contours in Figure 


[5] we infer a detection of anisotropies in the quasar-Lya 
emission correlation function that is opposite in sign to that 
expected from peculiar velocity flows due to gravitational 
evolution: our constraint on p a is 2.1 ct from p a = 0. We 
interpret this result as indicating that there are strong non- 
gravitational effects on the Lya emission causing the elonga¬ 
tion of the cross-correlation contours along the line of sight 
extending to large separations; the good fit obtained with 
the redshift-space distortion model with negative ft needs to 
be understood in this case as a coincidence, since the model 
is not physically correct. 

If we limit the fit to points with 20 /i -1 Mpc, we find 
central values for /3 a and a that are consistent with those 
for our fiducial fit, As might be expected, the error bars are 
larger, however, by a factor of 40% for a and 65% for P a - 

If we assume a Lya emission bias factor b a = 3, corre¬ 
sponding to highly biased star forming galaxies, (see Section 
Ofor further discussion of this value), then p a = 0.32 at red- 
shift z — 2.55. For the expected value of a, we can use as 
a guide the results of Font-Ribera et al. (2013) who con¬ 
strained a < 370 km s -1 at the 1 cr confidence level from the 
redshift-space quasar-Lya forest cross-correlation function. 
We can see from FigurcOthat although the cr measurement 
is consistent with Font-Ribera et al. , p a and cr considered 
jointly disagree at the 2.5<r level from our measurement. 

We discuss further in Section [5] how our measurements 
can be interpreted as a detection of clustering anisotropies 
due to radiative transfer effects (Zheng et al. 2011b). In this 
case, elongation along the line of sight is expected, which 
can explain the effective measurement of a negative p a pa¬ 
rameter. 

The cross-correlation contours of the best fit model are 
plotted in Figure [4] (right panel), which also reveals stretch¬ 
ing along the line-of-sight. The \ 2 value for the fit is 610 
measured from 400 bins, with 3 free parameters, a reduced 
X 2 of 1.5. The fit is therefore not good, and the discrepancy 
arises in large part in the central region, where the model has 
lower surface brightness than the observations. This result 
may be a sign that adding a one-halo term to the correlation 
function would provide a better fit, and should be addressed 
in future work with a larger data sample. 

3.5 Large-scale tests 

The cross-correlation across and along the line of sight over 
the whole spectrum offers a way to test whether the detected 
signal is reasonable. One can search for any significant cross¬ 
correlation signal if a different wavelength other than Lya is 
used, which would indicate either contributions from other 
emission lines, or that other effects are causing the cross¬ 
correlation signal meaasured. An equivalent approach is to 
extend the line-of-sight range of the cross-correlation to large 
distances. Figure [6l extends the contours shown in Figure [4] 
to much larger scales. The positive signal seen in Figure[5]is 
the most significant feature, centered at rn = 0, r± = 0. This 
is a good indication that Lya emission is the dominant con¬ 
tribution to our signal. Signal from lines at longer (shorter) 
wavelengths than Lya would appear at positive (negative) 
values of r§. 

The second most prominent feature, at r± = 0, rii ~ 
60h _1 Mpc, is significant at the ~ 1.5<r level (the pixel at 
the center of the feature is 1.5 a from the zero level, using 
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Figure 6. The quasar-Lya cross-correlation £ qct as a function 
of r|| and t_l . This figure shows the same information as Figure 
m except over a larger range of scales (see text). 


Table 1. The amplitude parameter b q b a fp{p) for different red- 
shift bins 


<*> 

^min 

^max 

bqb a fp{p) (using S2 m = 0.30) 

( 10 -20 erg s -1 cm -2 A -1 arcsec -2 ) 

2.20 

2.0 

2.29 

5.0t“ 

2.37 

2.29 

2.46 

4-0±g;| 

2.59 

2.46 

2.75 

O 4+0.8 

0.9 

3.04 

2.75 

3.50 

2 q+ 0.8 
z - u -0.8 


the Jackknife error bars from Section [3.411 . and so is consis¬ 
tent with noise. Strong lines that might be an issue, such as 
Lyman-/3 and Carbon-IV, are very far away (at rii = —490 
and +640 /i -1 Mpc respectively) and so are not a concern. 
Si-III would appear at r« = —22/i^ 1 Mpc if it was present. 


3.6 Evolution with redshift 

The redshift coverage of our data sample (z = 2 — 3.5) is suf¬ 
ficient that we can separate it into different bins in redshift 
and search for evolution. We do this for four different red¬ 
shift bins where each bin contains one quarter of the quasar 
data. The bin boundaries and mean quasar redshifts of each 
bin are given in Table [I] 

The quasar-Lya emission cross-correlation results are 
shown in the four panels of FigureQ The global CDM model 
fit to the full sample averaged over all redshifts is indicated 
as the dash-dot line in every panel. Although the results for 
the redshift bins are relatively noisy, as expected, they are 
broadly similar to the global result in shape and amplitude 
and show no clear evidence for any redshift evolution. 

Figure [8] presents the best fit CDM shape and amplitude 


parameters from Section l372l along with confidence contours. 
The fitting was again performed using jackknife error bars. 
Figure [5] demonstrates that the fiducial model results all lie 
within the 2 ct confidence contours for the different redshift 
bins. 

Assuming that the shape of the cross-correlation func¬ 
tion remains fixed in comoving coordinates (as it would do 
if governed by linear biasing), we can search for changes 
in the amplitude of clustering and the mean Lya surface 
brightness as a function of redshift. We set = 0.30 (the 
CDM shape determined by the Planck results; Ade et al. 
2014) and then determine the best fit amplitude parame¬ 
ters, b q b a fp{[i ), at each redshift bin. The result, shown in 
Figure [9] indicates a decreasing cross-correlation amplitude 
with redshift, although the errors are large and a horizon¬ 
tal line would not be an unreasonable fit “by eye”. To ex¬ 
press this quantitively, we have carried out a \ 2 fit to the 
function log {b q b a fp{fj))= a + bz, finding a slope parameter 
b = —0.40 ± 0.20, meaning that the hint of redshift evolu¬ 
tion is significant at the 2 .Oct level only. The values for the fit 
parameters b q b a fp(fj,) for the different redshift bins (which 
were used to plot Figure [8] are listed in Table [T] 

We have also looked at redshift evolution of the CDM 
shape governed by the parameter fl m . Within the assump¬ 
tion of linear biasing, the shape should remain constant with 
redshift. The results are examined in Figure [TOl where the 
results are indeed consistent with a constant within the 
uncertainties. The Planck value (fi m = 0.30) is also shown 
and is consistent with our results. 


4 EMISSION LINES 

Given that the sources of the clustered Lya emission seen 
in Section 3 could be discrete objects such as star form¬ 
ing galaxies, we must investigate see if individual emission 
lines can be detected in our spectra and whether discrete 
detectable lines can account for the signal. The BOSS spec¬ 
tra have relatively short integrations on a small (2.5 m) 
telescope, and cannot be expected to compete in individ¬ 
ual detections with other surveys such as that described in 
Cassata et al. (2011): we can only detect the most luminous 
objects. However, our cross-correlation technique enables us 
to find the mean total surface brightness, which includes 
all emission-line objects no matter how faint they may be. 
The difference between the cross-correlation signal with and 
without individually identified lines therefore enables us to 
discover what fraction of the Lya surface brightness lies be¬ 
low our line detection limit. 

We note that our line detection procedure is less sophis¬ 
ticated than that in the BELLS survey (Brownstein et al. 
2012), which used line detections of galaxies behind LRGs 
to find gravitational lenses. In particular, we are not seek¬ 
ing confirmed detections of objects (which requires multiple 
emission lines) and we do not deal with interlopers, except 
statistically. 


4.1 Line fitting 

For each LRG spectrum (see Section 2.1), we subtract the 
best fit galaxy spectrum model, as we do in our fiducial 
cross-correlation analysis. We then fit lines to this residual 
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Figure 7. The quasar -Lya emission cross-correlation function ^ qQ: (^) for different redshift ranges, which correspond to one quarter of 
the full dataset each. The solid lines indicate the best fitting CDM model at each redshift bin and the dash-dot line represents the best 
fit to the full dataset (shown in Figure [3}. Both the shape and amplitude of the model fit are consistent at the 2 a level with no evolution 
over the redshift range (this is shown in Figures l9l and fTol ). 


CT> 



Figure 9. The evolution of bqb a fp(fi) with redshift. The results 
from Figure [7] are used, but here we have fixed the CDM shape 
to one for a ACDM model with f2 m = 0.30 (see Section m 
. The error bars are I a and are estimated using the maximum 
likelihood fit to the amplitude of the CDM model. The solid line 
is a log-linear fit to the data (see text). 


flux, centering our fitted line profile on the center of each 
spectrum pixel, one at a time. In this first stage, each spec¬ 
trum pixel is therefore the center of a best fit line profile 
- we remove overlapping lines later. For each of a grid of 
values of amplitude A and line width a (between 0 and 
20 A) we compute the yf value of a positive Gaussian 
emission line profile G(x), where the profile has the form 
G(x) = A exp —(x 2 /2a 2 ) and x is the separation between 
the line center and the pixels we include in our fit. We use 
pixels in our fit that are in a 40 A region centered on the line 
center, excluding masked regions as in Section 2.1.1. Once 
the best fitting values of A and a are found, we estimate the 
significance of each fitted line from the x' 2 difference between 
the line fit and a flat interval with zero flux. After fitting to 
all the pixels we pass through the list and eliminate overlap¬ 
ping lines, removing the lower significance line when there 
is an overlap. 

We find 5200 lines with a nominal significance of 5 a 
(Ax 2 > 30.1 for the 2 degrees of freedom fitted), and 
1.6 x 10 6 lines with a nominal significance of 3 a (Ax 2 > 11.8) 
There are 1.3 x 10 9 pixels in the search regions of the spec¬ 
tra. The detected lines are constrained to be at least 40 A 
(37 pixels) apart, but for rare lines this should not change 
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Figure 8. Likelihood contours for power spectrum parameters f2 m (which we are using to parametrize the CDM shape) and bqbocf /3 (ji) 
(the amplitude) for different redshift ranges. The filled dot shows the best fitting values for that redshift range and the open circles the 
best fitting values for the full dataset (shown in Figure 0. 


the random expectation. One would therefore expect to find 
approximately 370 and 1.8 x 10 6 5 a and 3<r lines respectively 
from positive noise fluctuations alone. 

For the 3<r lines, the fact that we have detected fewer 
lines than even pure noise fluctuations predict is likely to be 
a sign that the fluctuations do not exactly obey a Gaussian 
noise model. An additional complication is that the noise 
estimate from the standard data pipeline which we have 
used has been shown to be underestimated by up to 16% 
for the relevant wavelengths (Palanque-Delabrouille et al. 
2013). The detection of more 5<r lines than randomly ex¬ 
pected is likely to indicate that there are false detections 
arising from unsubtracted features in the galaxy spectra, 
sky lines we have not accounted for, and other systematics. 
There is also the possibility of interloping [Oil] emission lines 
from lower redshift galaxies (see Noterdaeme et al. 2010, 
Menard et al. 2011). For our wavelength coverage of 3800- 
5500 A, this interloper emission would arise from between 
z = 0.02 and 2 = 0.48. Without a significant additional ef¬ 
fort to remove false detections and interlopers, our dataset 
is not useful for computing the luminosity function of Lya 
lines. Instead, we turn to the statistical cross-correlation to 
test for the fraction of these lines which are really Lya emis¬ 
sion lines in our redshift interval. 


4.2 Cross-correlation 

We subtract the flux in the lines detected in Section 4.1 
from each LRG spectrum, and then recalculate the cross¬ 
correlation of quasars and Lya emission (Equation [T]). The 
results for £ qct (r) (again computing the error bars using a 
jackknife estimator and 100 subsamples) are shown in Fig¬ 
ure EL using our two thresholds on the significance of the 
removed lines, 5<r and 3cr. We can see that in each case, the 
clustering signal is still visible and the shape traces that of 
a CDM curve. This shows that most of the surface bright¬ 
ness of Lya emission is not accounted for by these lines. 
As expected, most of the lines are due to noise features. 
By subtracting these lines, however, we are also subtracting 
any possible real lines, and the change in the amplitude of 
( qa (r) is a measure of the fraction of surface brightness that 
is actually contributed by strong lines. 

The shape and amplitude fitting parameters (fl m and 
bqb a fp{tj) ) for these two cases (> 5 a and > 3cr lines sub¬ 
tracted) are shown in Figure IT?! The shape parameter O m 
is very similar in the two cases and almost the same as in 
the fiducial case. The amplitude is lower, as would be ex¬ 
pected for subtraction of some real lines, but the fiducial 
result (with no line subtraction) lies well within the 1 er er¬ 
ror contour of both of the panels in Figure [12] implying that 
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Figure 10. The value of f2 m (which we are using to parametrize 
the shape of the ACDM correlation function, holding other pa¬ 
rameters fixed) vs redshift. The results from Figure[7] were used. 
The solid line is the best fit to the Planck results (Ade et al. 
2014), = 0.30. 

the contribution to the cross-correlation from emission lines 
that are detected and removed is not statistically significant. 
Quantitatively, this can be seen by considering that we find 
the amplitude parameter for the > 5a case to be b q b a fp{fi) 
= 3.18lo’4i x 10 -20 ergs -1 cm -2 A -1 arcsec -2 , and for the 
3a case to be b q b a fp(ij) = 2.89to'37 x 10 -20 ergs -1 cm -2 
A -1 arcsec -2 . The amplitude parameter is therefore 4±12% 
and 13l!li3% lower than the fiducial case for the > 5a and 
> 3a line removal cases, but both of these are consistent 
with zero within the errors. The analysis is therefore consis¬ 
tent with our line fitting having found no true Lya emission 
lines at all. 

In order to relate the significance levels to line lumi¬ 
nosity, we have computed the luminosity from the surface 
brightness for each line (bearing in mind that our mea¬ 
surements are restricted to a 1 arcsec radius fiber aper¬ 
ture). We find that the median luminosity of the > 5a 
lines is L — 9.0 x 10 42 ergs -1 and the > 3a lines have 
a median luminosity L = 1.9 x 10 42 ergs -1 . We can com¬ 
pare these luminosities measured with some published val¬ 
ues from Lya emitter surveys. The flux limit of the Guaita 
et al. (2010) data sample was 2 x 10 -17 ergs -1 cm -2 (emis¬ 
sion line flux) at z=2.1. This corresponds to a Lya luminos¬ 
ity of 5 x 10 41 ergs -1 . For the Gawiser et al. (2007) sample 
at z=3.1, the line flux limit was 1.5 x 10 -17 ergs -1 cm -2 , 
corresponding to a line luminosity of 1.3 x 10 42 ergs -1 . 

In our calculation above we have seen that at the 1 a 
confidence level, 13% of the Lya cross-correlation signal in 
our dataset could be due to lines with a median luminosity 
(in a 1 arcsec radius aperture) 1.9 x 10 42 ergs -1 . This is 
similar to the values of Gawiser et al. and Guaita et al. , 
although given our small aperture, the intrinsic luminosity 
of our fitted line emitters will be even higher. A small frac¬ 
tion of the b q b a fp{ii) value we are seeing could therefore be 
contributed by emitters similar to those in these two sur¬ 
veys. As noted above, the error bar on this fraction is large, 
and our result is consistent with zero contribution from such 
lines. 

There are various possibilities for the nature of the ma- 
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Figure 11. The quasar-Lya emission cross-correlation function 
£qa(r) (see Equation [TJ. as in Figure [2] but computed after 
subtracting emission lines that are apparently detected in the 
spectra at the 5a significance level in panel (a), and the 3a level in 
panel (b). The smooth curve is the best fit linear CDM correlation 
function (see Section I3.2D and the dash-dotted line is the best fit 
CDM curve for the fiducial sample (i.e., before subtracting the 
apparent emission lines). 


jor contributor to the Lya cross-correlation signal. The first 
is fainter lines than those seen in Lya emitter surveys. This 
is unlikely because b a is luminosity weighted, so low lumi¬ 
nosity lines contribute much less to the signal than higher 
luminosity lines. The second is high luminosity, low surface 
brightness emission. This emission is much more difficult to 
detect, and cannot be seen by our line search algorithm, 
which is sensitive to high surface brightness, narrow lines. 
It is also unlikely to have been seen in previous surveys (we 
return to this in Section |6j. This type of emission would also 
be highly biased, and so if present would contribute strongly 
to the bqb a fp{n) measurement. We discuss the various pos¬ 
sibilities in more detail in Section [6] 
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Figure 12. The effect of removing strong lines on the shape and 
amplitude of the quasar-Lya cross-correlation. We show the fit 
parameters for the amplitude b q bq —Lyae (l ! Lyry) and shape Q ni 
of a linearly biased CDM model fit to the Lya cross-correlation 
function plotted in Figure 1111 Panel (a) is after removing > 50- 
significance lines and panel (b) > 3o significance lines. The dots 
indicate the best fit parameters and the contours show the 1,2 and 
3 cr confidence contours on the fit parameters. The open circles 
show the best fit results to the fiducial results (from Figure 0. 


5 STAR FORMATION RATE DENSITY 

If the Lya surface brightness we are seeing is produced by 
star forming galaxies, we can convert the mean Lya surface 
brightness into a star formation rate density (SFRD) at the 
mean redshift z = 2.55 of our observations. Traditionally 
(e.g., Gronwall et al. 2007) narrow band surveys have been 
used to detect Lya emitters, compute their luminosity func¬ 
tion, and integrate it to compute a mean Lya luminosity 
density t a before converting it into a star formation rate, 
using a relationship such as: 

SFR/( Mq yr -1 ) = L Q /(1.1 x lO^ergs” 1 ) (17) 

(Cassata et al. 2011), where L a is the Lya luminosity. The 
conversion factor is based on a stellar population with a 
Salpeter IMF and with no correction for effects like dust 
and escape fraction, which is accurate to within a factor of 
a few for a range of population a ge, high mass cutoff of stars, 
and metallicity (iLeitherer et al.lll999l ). 

This method assumes that the surveys of Lya emitters 



Figure 13. The star formation rate density (psfr) inferred from 
our measurement of the mean Lya surface brightness in the Uni¬ 
verse between z = 2 — 3.5 (see Sect ion 0 is shown as the red point 
with solid line error bars, assuming that the linear bias factor for 
Lya emission is b a = 3, a reasonable value for the luminosity- 
weighted clustering of star forming forming galaxies (see Section 
01. The true value of b a is unknown, so this data point should 
be scaled by 3 /b a . Other data values plotted with open (black) 
symbols are from published Psfr values which used UV estima¬ 
tors. The solid (blue) points show estimates of Psfr computed 
from the luminosity functions of surveys for Lya emitters. The 
references are given in Section 0 The shaded area represents the 
range of dust corrected UV estimates compiled by Bouwens et al. 
( 2010 ). 


are able to capture all the radiation from young stars. How¬ 
ever, these surveys can only detect the high surface bright¬ 
ness portion of sources within a small angular aperture, and 
may be missing much of the Lya line intensity when it is 
scattered far out into the galaxy halo. In our case, the sta¬ 
tistical cross-correlation technique we are using should not 
be affected by any threshold in Lya surface brightness. We 
should therefore be able to compute the total star formation 
rate density from our measurement. One large uncertainty 
is absorption due to dust, which is known to significantly 
affect UV continuum and line estimators of star formation. 

Recall that our measurement is of the quantity 
bqb a fp{fj), so to compute the Lya surface brightness we 
need to have independent knowledge of 6 q , b a and fp. For 
the quasar bias factor we use the va lue measured for BOSS 
quasars bv iFont-Ribera et alJ ll2013l) . b q = 3.64+0^5- 

We recall that the bias factor b a is related to a 
luminosity-weighted bias factor from the definitions in 
Equations 00 (fe a = in the absence of radiative transfer 
effect), and is different from the number weighted bias 
factor of Lya emitters, &lab- To understand the difference 
in the values of the two bias factors, we start with the fol¬ 
lowing simple model for the Lya emission. 

If we assume that there are ( N(M )) galaxies per dark 
matter halo of mass M and that Lya emission comes from 
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galaxies in halos above a mass limit M m j n , then the spatial 
bias can be computed as follows (e.g. iBerlind fe Weinberd 

l2002h : 

(18) 

where 6h(M) is the bias factor for halos of mass M and 
dn/dM is the halo mass function. For the luminosity 
weighted bias factor, we have 

- LI MvwflS"/. XX L ^m dM • 

(19) 

where L(M) is the average Lya luminosity in halos of mass 
M. 

Observationally, a value of 6 lae = 1-75 ± 0.23 results 
from an average of the values measured by Gawiser et al. 
(2007) and Guaita et al. (2010), who find 6 lae = 
at z = 3.1 and &lae = 1.8 ± 0.3 at 2 = 2.1, respectively. 
Using Equation 1181 we find that 6 lae = 1.75 corresponds to 
M m i n = 10 11 Mg. In the calculation, we assume one galaxy 
per halo = 1), which overe stimates M m j n by on ly 

a sm all factor (see the Appendix of lZheng. Coil, fe Zehavil 
l2007h . The mass estimation is consistent with, e.g., the anal¬ 
ysis of lGawiser et al.l (120071 ") . 

To estimate the luminosity-weighted bias factor we 
need to know the relation between luminosity and halo mass. 
Assuming that L(M) oc M p , for M m i n = 10 11 Mg we ob¬ 
tain = 2.38, 4.40, and 6.84 for the values p = 1, 2, and 
3, respectively. The differences between 6lae and are 
therefore substantial, the latter being usually much larger. 
To proceed further, we need to consider the likely relation 
between L and M. The Lya luminosity is related to the 
star formation rate in galaxies. Along the star-forming se¬ 
quence, the star formation rate is inferred t o be approx¬ 
imately proportional to th e stellar mass (e .g. iDaddi et al.l 
120071 : IPannella et al.l 120091 ; iLee et all l201ll ). We therefore 
first make use of the relationship between halo mass and stel ¬ 
lar m ass found with abundance matching by iMoster et al.l 
d20ld) . It is in the form of a softened broken power law, 
and at 2 ~ 2.55 the low-mass-end (high-mass-end) slope of 
the L-M relation is ~ 2.5 (~ 0.6) with a transition mass 
around 10 12 Mg. Using this relation in Equation 1191 yields 
= 2.82 for M m i n = 10 11 Mg. In fact, the result is insen¬ 
sitive to M m i n (&l = 2.81 for M m i n = 10 9 Mg), given the 
steep L-M relation (so low-mass halos are weighted less). If 
we modify the high-mass-end slope to ~ 1 to approximately 
account for the luminosity contribution from satellite galax¬ 
ies, we obtain ftj, = 3.15. 

There are various uncertainties involved in deriving fen 
with this simple model. First, the slope of the star forma¬ 
tion rate versus the stellar mass relation for the star-forming 
galaxy sequence can be slightly different from unity. We find 
that a 10% deviation from unity in the above slope leads to a 
~ 5% change in the value of for,. Second, we assume that the 
Lya luminosity is proportional to the star formation rate. 
The way these two quantities track each other may vary as 
a function of star formation rate if, for example, the escape 
fraction of Lya photons varies. Another possibility is that 
a large fraction of the Lya emission comes from previously 
undetected sources or low surface brightness halos around 
galaxies. These factors will change the L-M relation and 



Figure 14. Probability distribution functions (PDFs) of Lya lu¬ 
minosity density (solid curves) and halo-bias-weighted Lya lumi¬ 
nosity density (dashed curves) for our fiducial model. The PDFs 
are computed as proportional to the average Lya luminosity or 
halo-bias-weighted Lya luminosity in halos of mass M multiplied 
by the differential halo mass function dn/d log M. The blue curves 
use only the Lya luminosity of central galaxies, and the red curves 
include contributions from satellite galaxies. All curves have been 
normalized to unity at their respective maxima. See the text for 
further details concerning the model. 


therefore the derived value of Even with the above un¬ 
certainties, it is likely that for, is around 3. 

With the above model, we can compute the contribu¬ 
tion to the Lya luminosity density from halos of different 
masses, which is simply proportional to the average Lya 
luminosity in halos of mass M times the differential halo 
mass function dn/d log M. The solid blue curve in Figure [141 
uses central galaxy Lya luminosity only, which peaks around 
log (M/Mg) = 12.25. Including the contribution from the 
satellite galaxies shifts the peak slightly to a higher mass, 
around log(M/Mg) = 12.45, as shown by the solid red 
curve. The curve gives the probability density of a random 
Lya photon to come from a halo of mass M. The Lya emis¬ 
sion detected through the cross-correlation technique probes 
the halo-bias-weighted luminosity density distribution. The 
dashed curves in Figure [T4] show these probability distri¬ 
butions. With the satellite contribution included, the curve 
peaks around log(M/Mg) = 12.6. Taking the full width at 
half maximum of the curve, the fiducial model implies that 
the signal in the cross-correlation should mainly come from 
Lya emission in halos of mass (1-20)x 10 12 Mg. 

Finally, as mentioned in Section [3. 2 1 bi, is not the same 
as b a once the radiative transfer effect is taken into account. 
A simple model shows that b a = bL + ai with ai a posi¬ 
tive number (see Section 16.41) . The value of ai is not readily 
known without detailed radiative transfer modeling. Overall, 
we expect b a to be larger than ~3. We choose to parame¬ 
terize derived quantities in terms of (3 /b a ). 
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An additional uncertain factor to consider is the modi¬ 
fication to clustering caused by redshift-space distortions. 
This is embodied in the fp parameter. We have seen in 
Section El that measurements of anisotropies in cluster¬ 
ing give a measurement of /3 a = ^m 6 (" = 2.55)/6 a = 
—0.76 ± 0.36. This negative value of f3 a is of the form ex¬ 
pected to be caused by ra diative transfer effects on clus¬ 
tering uZheng _et_alj l2011aft and is opposite in sign to the 
usual iKaiserl ( 1987ft peculiar velocity redshift-space distor¬ 
tions. Neverthless, this redshift-space distortion model was 
shown in Section El to give a reasonable fit to the data 
and one can use this to compute the factor fp as fp = 
1 + j(/3q + /3 a ) + |(/ 3q/3 a ) (from equation [10]). If we do this 
we find that the value of fp = 0.80±0.15, which we take as a 
reasonable estimate of the reduction of the monopole term 
due the spreading the correlation along the line of sight. 
Even though gravitational evolution is not the physically 
correct model for interpreting our observations owing to the 
negative value we obtain for f3 a , a model with redshift-space 
distortion plus radiative transfer effect does seem to work 
reasonably well here (see Section T6.4ll . 

We use this value and propagate the errors from the 
bias measurements and our measurement of b q b a fp{n) (for 
fixed shape parametrized by = 0.30), to compute the 
mean Lya surface brightness at z = 2.55, finding 


(p, Q ) = (3.9±0.9) x 10 21 (3 /&q.) ergs 1 cm 2 A 1 arcsec 2 . 

( 20 ) 

We convert this into a comoving Lya luminosity density 
t a using 


e a = 47r( Ma )^A Q (l + z) 2 , (21) 

c 

where c is the speed of light and X a = 1216 A. We find 
the value e a = 3.1 x 10 41 (3/& a ) erg s -1 Mpc -3 . We then 
use Equation [17] to convert this into a measurement of 
the star formation rate density psfr(z = 2.55) = (0.28 ± 
0.07)(3/6 a ) Mgyr -1 Mpc -3 . As mentioned before, the con¬ 
version depends on the assumption about the underlying 
stellar population. A younger population and lower metal- 
licity would lead to a lower inferred SFR, which could be an 
important effect for interpreting our results. Keeping this 
possibility in mind, we proceed with the discussion by us¬ 
ing the above result from the commonly adopted conversion 
factor. 

We plot this result in Figure IT31 as the red point, for the 
chosen value of b a = 3. We note that the true value for the 
parameter b a is not well determined, and that the psfr dat- 
apoint should be scaled up and down by the factor 3/b a - Our 
discussion above suggests that b a is likely to be larger than 
3, so that it is more probable for psfr to be scaled down¬ 
wards than upwards. Figure IT51 also shows various estimates 
of psfr from Lya emitter surveys as well as UV continuum 
estimates of the star formation rate. We can see that our 
measurement is about 30 times higher (for b a = 3) than the 
Lya emitter based measurements of Gronwall et al. (2007), 
Ouchi et al. (2008) or Cassata et al. (2011). Note that these 
Lya emitter based measurements result from a direct inte¬ 
gration over the observed Lya luminosity function without 
corrections for any possible dust effect. 

A complication which adds substantial uncertainty is 
dust absorption, which may affect Lya and continuum ra¬ 


diation differently. One estimator of the level of dust ex¬ 
tinction in the continuum is the rest-frame UV continuum 
slope, /?, which specifies how the flux density of a galaxy 
varies with wavelength (i.e., f\ oc A 3 ) in the UV contin¬ 
uum region (~ 1300A to ~ 3500A) of its spectrum. If an 
intrinsic /1-dust extinction relationship is assumed, (usually 
that measured empirically from z ^ 0 galaxies by Meurer 
et al. 1999), one can use observations of j3 for high redshift 
galaxies to infer a dust-corrected UV luminosity and then 
star formation rate. This has been done by several authors, 
including Bouwens et al. (2009,2010). In Figure fl3l we show 
as a blue band the compilation of dust-corrected UV star 
formation densities from Bouwens et al. (2010) computed 
using this technique. According to Bouwens et al. (2010), 
the dust correction for a limiting luminosity of 0.3 L*_ 3 is 
6.0±2.5 at redshift z = 2.5. Support for the validity of these 
corrections comes from the agreement of dust-corrected UV 
values with psfr estimated from infrared observations (see 
the recent review by Madau & Dickinson 2014). 

As for the effect of dust on the Lya radiation, Cassata 
et al. (2011) have speculated that it could be even stronger 
than proposed by Bouwens et al. (2010), as the Lya emitter 
inferred psfr is less than 20% of the non-dust corrected UV 
continuum value (as can be seen in Figure fT3T) . One reason 
which favors this interpretation is the fact that resonantly 
scattered radiation does have to cover a longer path length 
than continuum radiation before it leaves a galaxy so that 
it could be more vulnerable to dust extinction. On other 
hand, there is one well known mechanism (Neufeld 1991) 
which could preferentially enhance the escape of Lya radia¬ 
tion: in a clumpy medium of dusty clouds, continuum (UV) 
photons are absorbed as soon as their path crosses an op¬ 
tically thick dust cloud, whereas Lya photons can bounce 
off the cloud surfaces and find their way through the clouds 
to escape, leading to a lower extinction for Lya than for 
continuum photons if the intercloud medium is sufficiently 
devoid of dust. The an isotropic escape of Lya radiation 
dZheng fe Wallacell2014ft caused by, for example, a bipolar 
galactic wind, can also help make Lya photons follow the 
path of lower extinction optical depth to escape, while UV 
continuum photons are emitted isotropically and on average 
experience more extinction. From Figure [Till it appears that 
some mechanism of this sort is needed if we are to explain 
our results. We discuss these issues further in Section [6] 

In conclusion, the rather surprising result seen in 
Figure [13] is that the fiducial value of the Lya surface 
brightness from our measurement is consistent with all 
Lya photons produced in stars at z = 2.55 escaping 
from their host galaxies and being detected. The dust- 
corrected results of Bouwens et al. (2010) imply psfr = 
0.19^(o '06 Moyr -1 Mpc -3 at z = 2.55, and from our mea¬ 
surement, psfr = (0.28 ± 0.07)(3/bo.) M 0 yr _1 Mpc -3 . This 
means that b a > 3.0 is needed for our measurement not to 
imply detection of more Lya photons than are actually pro¬ 
duced at more than the 1 — a level. Our simple model for 
b a does satisfy this limit. We note that the intensity map¬ 
ping technique we use in this paper will detect Lya pho¬ 
tons which are scattered into our sightline from arbitrarily 
large distances from the emitting galaxy, and at arbitrarily 
low surface brightness. One can characterise our measure¬ 
ment as being consistent with a “total escape fraction” of 
Lya photons from star-forming galaxies of 100 %. This to- 
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tal escape fraction could be much higher from the “detected 
escape fraction” measured from traditional surveys of Lya 
emitters, which have surface brightness limitations. We also 
discuss this in more detail in Section [6] 

6 DISCUSSION 

We can frame further discussion of our results in terms of 
the following questions: 

(i) Can the observed Lya surface brightness be explained 
by known Lya emitters? We have seen in Section [5] that the 
simple answer appears to be no. 

(ii) Can the observed Lya surface brightness be explained 
by faint Lya emitters below the threshold of published sur¬ 
veys? Ouchi et al. (2008) have shown that changing the ex¬ 
trapolated luminosity function faint end slope a from —1.0 
to —2.0 changes the total integrated Lya luminosity den¬ 
sity they infer by only 20%. The answer to this question is 
therefore also no. 

(iii) Can the observed Lya surface brightness be ex¬ 
plained by extended halos around the known Lya emitters? 
We show in Section 16.3.51 below that this is not the case, 
and that extended halos around known Lya emitters, while 
adding to the mean surface brightness, fall short of account¬ 
ing for our results by an order of magnitude . 

(iv) Can the observed Lya surface brightness be ex¬ 

plained by star-forming galaxies, when these are estimated 
based on the extinction-corrected UV continuum SFR. den¬ 
sity? We have seen in Section [5] that there appears to be 
just enough star formation per unit volume in the Universe 
at z — 2.5 that if most of it led to escaped Lya emission, 
this could explain what we are seeing. We further discuss 
the implications of this below in Sections and 17721 

(v) What is the contribution to our meaasurement of 
sources of Lya emission beyond star-forming galaxies? We 
address this in Section ib. 31 below, showing that other contri¬ 
butions are likely to be very small. 


6.1 Potential systematic errors in the 
measurement 

Our measurement of Lya intensity clustering relies on statis¬ 
tical cross-correlation techniques applied to a large sample of 
spectra with relatively low signal to noise, all of which were 
targeted at bright foreground galaxies that we have removed 
in post-processing. There is therefore ample scope for small 
instrumental or other effects to influence the signal we mea¬ 
sure. Bearing this in mind, we have carried out a range of 
tests, detailed in Appendices A-C, to make as certain as pos¬ 
sible that the signal is real. Most importantly, these include 
tests of our methods for eliminating contaminating light 
from neighboring fibers, which does have a strong effect. 
We have also tested the effect of eliminating quasars clus¬ 
tered with the quasars we are using in our cross-correlation, 
and we have checked the dependence of the signal on the 
luminosity of the galaxy fiber target and on the quasar lu¬ 
minosity. For these latter tests, we have found no significant 
effect and therefore conclude that the signal is real, to the 
extent we have been able to ascertain. 


There remains the possibility that some previously un¬ 
known effect, instrumental or otherwise, is responsible for 
the cross-correlation signal. This clustering signal would 
have to have a shape and amplitude consistent with ACDM, 
yield a star formation rate density consistent with dust- 
corrected UV estimates, have clustering distortions of the 
form predicted by Lya radiative transfer effects, and pass 
all the tests mentioned above. This would constitute an un¬ 
lucky coincidence; in particular, contamination by quasar 
light seems very unlikely once we have eliminated the ef¬ 
fect of neighboring fibers in the BOSS camera and we have 
tested the absence of a dependence on the quasar measured 
flux. Galactic dust absorption or other systematics on the 
spectral continua also cannot induce the enhanced emission 
that is measured at the inferred Lya wavelength at the red- 
shift of each observed quasar. However, the reader should 
bear in mind this possibility until future independent work 
is able to confirm our measurement. 

We note that the effect of gravitational lensing on 
our results should be zero, even though we use spectra of 
bright galaxies, because our Lya measurements are of sur¬ 
face brightness, which is conserved under lensing. If on the 
other hand we were detecting Lya emitters in the fibers and 
obtaining their luminosity function, this would be subject to 
the well known magnification bias (e.g., Turner 1980). In our 
case, we are computing the cross-correlation function of the 
surface brightness measured from all fibers with quasars, 
and the magnification of Lya sources cannot change the 
amplitude of the cross-correlation. Dust associated with the 
foreground galaxies might reduce the Lya emission coming 
from higher redshift, but this could only further increase the 
inferred brightness of the Lya background. 


6.2 Star-forming galaxies and the photon budget 

We have found in Section [S] that our detected signal of cross¬ 
correlation of Lya surface brightness with quasars implies 
a brightness for the mean Lya photon background given by 
equation dm. This at the same time implies an emissivity of 
Lya radiation of t a = 3.1 x 10 41 (3/fe a ) ergs -1 Mpc -3 . This 
emissivity can be reexpressed in terms of the rate at which 
Lya photons must have been emitted for each baryon in the 
universe at the mean redshift of our observation, z = 2.55. 
Using the comoving number density of baryons rn, = 2.5 x 
10 -7 cm -3 , and an expansion rate at z = 2.55 of H(z = 
2.55) = 261 kms -1 Mpc -1 (using the parameters Qbh 2 = 
0.0221, Ho = 68kms -1 Mpc -1 , and = 0.315, consistent 
with the most recent determinations from Planck in Ade et 
al. 2014), we find the following result: 

i^±im =306 l (22) 

The first, most simple assumption we make is that these 
photons are mostly originating from star formation in galax¬ 
ies. The Lya photons created by recombinations in the HII 
regions produced around massive stars can then be scattered 
out to gaseous halos surrounding galaxies, from which they 
give rise to the background we detect in the quasar-Lya 
emission cross-correlation. As discussed in Section 5, this 
implies a very large star formation rate at z = 2.55. Equa¬ 
tion ED can also be recast in terms of the number of Lya 
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photons emitted for each baryon that forms stars, n a /ns'. 


n a m p (l.l x 10 42 ergs 1 yr) photons 

- — - 1 -w- — loUU —- 

rib hi/oL Mq baryon 


(23) 


Comparing to equation (1221) . we see that this implies that, 
for b a = 3, about 10% of all the baryons in the universe 
would have to turn into stars if the star formation rate 
is maintained over the age of the universe at z = 2.55, 
(2/3 )H~ 1 (z = 2.55) = 2.5 x 10 9 years. The difficulty with 
this very high star formation rate is twofold: estimates of 
the total fraction of baryons in the form of stars at present 
are near 6% (Fukugita & Peebles 2014; Shapley 2011), and 
as described in the previous section, the total star forma¬ 
tion rate at the redshift of our measurement can reach this 
value only for the maximum estimates of dust absorption, 
which would imply that while the UV continuum has to be 
absorbed by factors of ~ 5, the Lya photons would have to 
emerge suffering little dust absorption. 

A first possible solution to the problem of this extremely 
high inferred star formation rate is to modify the Initial 
Mass Function (IMF) of the stellar population that is as¬ 
sumed in deriving the Lya photons emitted per baryon in 
equation (1221) from population synthesis models. If the slope 
of the IMF is flatter in the high mass range of 10 — 100 Mq, 
then stars above ~ 20 Mq, which dominate the production 
of ionizing photons, increase their abundance compared to 
~ 10 Mq stars, which dominate the observed UV contin¬ 
uum. Most of the ionizing photons can be absorbed in HII 
regions, but most of the Lya photons produced by subse¬ 
quent recombinations can escape, and their production rate 
can be increased compared to that inferred from the UV 
continuum observations. If the IMF stays flat down to lower 
masses, that can also greatly reduce the total star formation 
rate that is implied, as well as the stellar mass that is derived 
for the present universe which is measured from the old stel¬ 
lar population dominating the present luminosity of galax¬ 
ies. If this flat IMF occurs particularly in massive galaxies 
with high metallicity, then the UV continuum observed at 
s = 2.55 can be further reduced due to the suppression of 
blue horizontal branch stars, and the luminosity-weighted 
bias factor of the Lya emission can be further increased. 
A top-heavy IMF during the epoch when most stars were 
formed implies a large increase in the production of heavy 
elements, but this may be consistent with observations that 
show relatively high metallicities in the intracluster medium 
and in massive galaxies (Renzini & Andreon 2014). 

For the rest of this section we assume a standard IMF 
with the ratio of Lya photons produced per baryon form¬ 
ing stars in Equation (1221) . We next enumerate additional 
sources that might contribute to this background other than 
star-forming galaxies, estimating if any of these contribu¬ 
tions might be substantial. Finally, we discuss the effects of 
Lya radiative transfer on the quasar-Lya background cross¬ 
correlation we detect. 


6.3 Other sources of Lya emission beyond 
star-forming galaxies 

We shall discuss here four possible contributions to the 
quasar - Lya emission cross-correlation not arising from 
star-forming galaxies clustered around the quasars: (1) Scat¬ 
tering of the quasar Lya broad emission line by the Lya 


forest. (2) Fluorescence of the ionizing radiation from the 
quasar. (3) Scattering of the cosmic UV background by the 
Lya forest in the overdense IGM around quasars. (4) Flu¬ 
orescence of the general cosmic ionizing background by the 
overdense IGM around quasars. (5) Lya cooling radiation 
from radiative dissipation of gas in halos that are correlated 
with quasars. 


6.3.1 Scattering of the quasar Lya broad emission line 

The average observed flux of our sample of BOSS quasars 
within the central ~ 2000 km s -1 of the Lya broad emission 
line is close to f a ~ 1CU 16 ergs -1 cm -2 A . In general, the 
mean fraction of light that is found to be absorbed by the 
Lya forest at z = 2.55 is 1 —F ~ 0.2 (F is the mean transmit¬ 
ted fraction; e.g., Faucher-Giguere et al. 2008). At a char¬ 
acteristic impact parameter of ~ 10/i -1 Mpc (inside which 
our cross-correlation signal is strongest), corresponding to 
an angular separation 9 ~ 500 arcsec, the surface brightness 
of the scattered radiation should be f a (l — F)/(ipn9 2 ), where 
ip is a dimensionless number that depends on the geometry 
of the scattering gas around the quasar and the shape of 
the Lya emission line, and has a value ip — 4 for a uniform 
gas density and a flat emission line profile. This yields a sur¬ 
face brightness ~ 7x 10 -24 ergs -1 cm -2 A arcsec -2 , more 
than two orders of magnitude lower than our measured ex¬ 
cess surface brightness at an impact parameter of 10h -1 Mpc 
from a quasar, as shown in Figure [2] 

Scattered light from the quasar broad Lya emission line 
is therefore a negligible contribution to our detected Lya 
background. This is consistent with our test in Appendix C.3 
showing no dependence of the cross-correlation amplitude on 
the quasar luminosity. 


6.3.2 Fluorescence of the quasar ionizing radiation 

An excess of Lya emission around the quasar may also arise 
from fluorescence of the ionizing radiation from the quasar. 
The ionizing radiation is absorbed in intergalactic absorp¬ 
tion systems, and about half of the energy is reemitted as 
Lya photons when the recombinations that maintain ioniza¬ 
tion equilibrium take place. These Lya photons should be 
predominantly emitted from systems with column densities 
Nhi ~ 10 17 cm -2 , for which the Lyman limit optical depth 
is of order unity. 

The ratio of this fluorescent emission to the scattered 
Lya radiation from the quasar can be estimated as follows. 
The characteristic equivalent width of the Lya emission line 
of quasars is cW a /X a ~ 2 x 10 4 kms -1 (see, e.g., the com¬ 
posite spectrum of BOSS quasars in Figure TOU) . Assuming 
a continuum slope bluewards of the Lya line /„ oc i/ -1 ' 5 
for the average quasar, we find that the ratio of the num¬ 
ber of ionizing photons to Lya photons emitted by a quasar 
is (3/4) 1 ' 5 ( 2/3)\ a /W a ~ 7. Around two thirds of these ab¬ 
sorptions of ionizing photons result in a Lya photon (the rest 
end up producing two-photon emission from the 2s state of 
the hydrogen atom), whereas only ~ 20% of the Lya pho¬ 
tons on the blue half of the quasar emission line are scattered 
in the Lya forest at z = 2.55. The ratio of the total num¬ 
ber of fluorescent to scattered Lya photons from a quasar is 
therefore 7(2/3)/0.2/0.5 ~ 40. 
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To estimate the ratio of the contribution to the excess 
Lya surface brightness near a quasar from the two processes, 
we also need to take into account the different pathlengths 
over which the photons are absorbed or scattered. Whereas 
Ly a photons in the blue half of the quasar emission line will 
be scattered over a pathlength corresponding to the typi¬ 
cal half-width of the Lya emission line, l a ~ 5000 kms -1 , 
the ionizing photons will be absorbed over the longer mean 
free path between Lyman limit absorbers at z = 2.55, which 
is III ~ 30000 kms 1 (Prochaska et al. 2014). The sur¬ 
face brightness produced near the quasar is proportional to 
the total number of Lya photons produced divided by the 
pathlength over which they are emitted. Therefore, we con¬ 
clude that the fluorescent emission from quasars should be 
40/6 ~ 7 times brighter in surface brightness than the scat¬ 
tered light from their Lya broad emission line. 

Even this factor of 7, however, raises the estimated 
surface brightness contribution from fluorescence at an 
impact parameter r = 10fi -1 Mpc to f qa (r) ~ 5 x 
10 -23 erg s -1 cm -2 A arcsec -2 , which is still a factor of 
~ 30 below our measured value at this impact parameter 
from Figure [5] 

We note here that this Lya fluorescent emission from 
the quasar would not be of uniform intensity, but would pre¬ 
dominantly arise from Lyman limit system absorbers in the 
quasa r vicinity (see, e.g., the simulations in lKollmeier et akl 
l20ldb This does not matter for our measurement, which is 
sensitive only to the mean surface brightness. In any case, 
our discussion suggests that the main difficulty for detecting 
this component of fluorescent emission caused by individual 
quasars probably lies in distinguishing it from the Lya emis¬ 
sion scattered in gas halos surrounding star-forming galaxies 
that are clustered with the quasar, which is what we believe 
dominates the production of the Lya photons in our signal. 

We also point out here that the quasar fluorescence con¬ 
tribution is affected by the anisotropy of the quasar emission 
and the time-delay between our observation of the quasar 
and that of the fluorescent light. Typically, if quasars are 
highly anisotropic and variable, the quasar emission in a di¬ 
rection away from the line of sight, or at ~ 10 7 years ago 
corresponding to the time when the quasar was illuminating 
an absorber at an impact parameter r ~ 10/i -1 Mpc, would 
be systematically lower than the one we observe at present 
because of the flux-limited selection of quasar catalogues. 
These effects are likely to be important in any detailed mod¬ 
elling of the fluorescent emission due to the quasar. 


6.3.3 Fluorescence of the cosmic ionizing background 

Fluorescence from the mean cosmic ionizing background can 
also contribute to the quasar-Lya cross-correlation we mea¬ 
sure. In this case, if we assume the intensity of ionizing 
photons from distant sources is uniform around the quasar, 
the effect on the cross-correlation would arise only from the 
overdensity of Lyman limit absorbers near the quasar, be¬ 
cause a uniform Lya brightness does not contribute to our 
detected signal. We consider a value of the photoionization 
rate at z = 2.5 of T = 10 -12 s -1 , which corresponds to a 
proper intensity per unit wavelength of the ionizing back¬ 
ground of i\ ~ 3 x 10 -2 ° erg s -1 cm -2 A arcsec -2 at the ion¬ 
ization edge A = 912 A (e.g., Faucher-Giguere et al. 2008). 


This ionizing background is being absorbed over a mean free 
path A i ~ cH~ 1 /10 at z = 2.5 (Prochaska et al. 2014). We 
assume that the ionizing background has a spectral slope —/3 
in frequency (i.e., i\ oc A^ -2 below A = 912 A). Each ioniz¬ 
ing photon produces a Lya photon with probability close to 
2/3, so the intensity of the Lya background that is produced 
is 


_ 2 / 3\ 2 i x cH~ 
3 \4/ /3 A 


The factor (3/4) 2 arises from the ratio of the Lya to the 
Lyman limit wavelength (one factor for the energy of each 
photon, and one for the fact that the intensity is per unit 
of wavelength). This approximates A; as being indepen¬ 
dent of frequency, which is valid for optically thick ab¬ 
sorbing systems (for optically thin systems, /3 in the de¬ 
nominator needs to be replaced by /3 + 3, and the re¬ 
ality should be intermediate). Using /3 = 2, we obtain 
i a ~ 5x 10 -2 ° ergs -1 cm -2 A 1 arcsec -2 , at z = 2.55. When 
observed at the present time, the intensity of this Lya back¬ 
ground is reduced to 

iao = *«/(! + z ) 5 — 10 -22 ergs -1 cm -2 A 1 arcsec -2 . (25) 


This mean intensity now needs to be multiplied by the cross¬ 
correlation function of Lyman limit absorbers and quasars, 
to obtain the contribution to our measured cross-correlation. 
The value of the mass autocorrelation at impact parameter 
r = 10h -1 Mpc and z = 2.55 is ~ 0.05, which needs to be 
multiplied by the bias factor of quasars and the bias factor 
of Lyman limit absorbers. The bias factor for the BOSS 
quasars is known, b q ~ 3.5, and that of the Lyman limit 
systems is unlikely to be larger than the value for DLAs, 
bn — 2 (Font-Ribera et al. 2012), implying that fp ~ 1.3 
and b q bLLfp£,m{r = 10ft -1 Mpc) is ~ 0.45. This yields a 
value of the contribution to the quasar-Lya cross-correlation 
of ~ 4.5 x 10 -23 ergs -1 cm -2 A 1 arcsec -2 , a factor of ~ 30 
below our measured value from Figure [2] 

We note that this estimate is consistent with a cruder 
calculation which uses the ionizing background observation- 
ally inferred by Fontanot et al. (2014) from the comoving 
space density of quasars and star forming galaxies. After 
exploring the likely parameter space of limiting magnitudes 
and escape fractions, Fontanot et al. (2014) find that a 
central value for the ionizing background comoving emis- 
sivity is about 3 x 10 50 photons s -1 Mpc -3 at z ~ 2.55. If 
the emissivity of the ionizing photons that are converted 
to Lya photons is at a similar level, with a conversion 
efficiency of 2/3, the fluorescent Lya emissivity is then 
2 x 10 50 photonss -1 Mpc -3 . This is 3.2 x 10 39 ergs -1 Mpc -3 , 
~ 100 times lower than the value inferred from our results. 

We therefore conclude that fluorescence from the cosmic 
ionizing background also cannot be responsible for the large 
Lya brightness that we detect correlated with quasars. 


6.3.4 Scattered Lya radiation from the radiation 
background 

Just as scattering from the quasar by the Lya forest can in 
principle produce some contribution to this Lya light around 
quasars, one may think that the general ultraviolet contin¬ 
uum background from distant galaxies and quasars could 
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also give rise to an excess of Ly a photons near quasars be¬ 
cause of the overdensity of the Lya forest that scatters this 
background radiation. However, this effect actually cancels 
out for our observation. High redshift galaxies that are be¬ 
hind the quasar should show the Lya forest absorption, re¬ 
ducing their ultraviolet flux, and this Lya absorption will be 
enhanced because of the overdensity surrounding the quasar. 
The background sources (which are of course too faint to 
be detected individually, but always contribute to our total 
background) are therefore fainter near quasars compared to 
any random fields. This is exactly compensated by the scat¬ 
tered Lya background from these same sources. For this 
reason, this contribution to the Lya background could only 
be detected if the individual sources behind the quasar were 
individually detected and subtracted out, before evaluating 
the Lya background intensity. 

In addition to Lya photons scattered by the Lya forest, 
the background would also have a contribution from back¬ 
ground photons reaching the Lyy wavelength when passing 
near the quasar and being downscattered to Lya . This con¬ 
tribution is also very small, and is also nearly cancelled un¬ 
less the population of ultraviolet sources creating the back¬ 
ground evolves very fast over the redshift interval corre¬ 
sponding to the ratio of the Lyy to the Lya wavelength, 
(note that the Ly/3 forest does not contribute to this diffuse 
emission by intergalactic gas because Ly/3 absorptions can 
only end in a 2-photon emission from the 2s atomic state of 
hydrogen.) 


6.3.5 Low surface brightness Lya halos around galaxies 

We have described in Section [5] how Lya emission from 
galaxies could account for our measured result. Due to the 
magnitude of our signal, this Lya emission does not appear 
to originate in a straightforward way from Lya emitters seen 
in narrow band surveys at these redshift. 

The Lya luminosity from observed Lya emitters is 
usually defined by pixels above certain surface bright- 
ness thresho ld or measured within an aperture (e.g., 2”; 
lOuchi et al.l 120081 1. In any case, the surface brightness 
threshold is well above the sky noise and higher than the 
level associated with our measurement in this paper. An ex¬ 
tended Lya emitting halo below sky noise level, resulting 
from Lya photons scattered by neutral gas in the circum- 
galactic and intergalactic media and clustered unresolved 
Lya sources, is predicted to exist around a star-forming 
galaxy based on radiative transfer modeling (I Zheng et all 
l2011bl l. Such diffu se Lya emitting halos are detected from 
stacking analysis (istcidel ct al. 1 120 111 : lMatsuda~ et al. 1 [20121; 
iMomose et alJl2014h . 

We use the stacked Lya surfa ce brightness profiles and 
the fits to the diffuse Lya halos in IMomose et all d2014l l to 
estimate the contribution of these halos to Lya emission in 
our detection. The luminosity inside the aperture of radius 
2" roughly corresponds to the Lya luminosity from the Lya 
emitter survey. We find that at both z = 2.2 and z = 3.1, 
the diffuse Lya emission outside of 2 " is about one third of 
the luminosity inside the aperture. Therefore, the observed 
diffuse Lya halos, regardless of their origin, may only con¬ 
tribute an additional one third to the Lya emission from 
Lya emitter surveys. 

This implies that our hypothesized extended emission 


from scattering halos that may account for our measure¬ 
ment would have to arise in many more galaxies than those 
detected in Lya -emission surveys. 


6.3.6 Cooling radiation 


Cooling radiation from gas in galactic halos can produce 
Lya emission. A rough estimate of the cooling radiation 
can be made if we assume that cooling and star forma¬ 
tion reach a steady state on average in a galaxy (i.e., 1 Mq 
of gas cools per year to feed 1 Mg yr -1 star formation). 
For gas initially at virial temperature T dissipating the en¬ 
ergy throug h cooling (as suggested by cosmo lo gical simula¬ 
tions; e.g., Fardal et al.ll200ll ; iFaucher-Gieuere et al.l l20ld : 
iGoerdt et al.ll201C)l '). we can estimate the corresponding cool¬ 
ing luminosity as follows: 

Let us assume that a fraction f c of all baryons in the 
Universe fall into halos of virial temperature T, and they 
dissipate all their energy by emitting Lya photons. Initially 
the baryons fall into the halo and are shock heated to tem¬ 
perature T at a radius r v ~ at/6 in the halo, where t is the 
age of the Universe, and in the end they have to reach a 
radius r g by dissipating their energy through Lya emission, 
where r g is the half-radius of the galaxy. 

The circular velocity v c = V2a is assumed to be flat, 
independent of radius. Then, the potential difference from 
r v to r g is </> = v 2 log(r„/r c ). The energy to be dissipated 
per baryon is therefore m p (j). The relation between v 2 and 
the temperature T is a 2 = kT //x, where /r = 0.6m p for the 
fully ionized mixture of hydrogen and helium from Big Bang 
nucleosynthesis. So, the total energy dissipated per baryon 
in the universe is 

t b = fcmpcp = ^LkTlog(r v /r c ), (26) 

0.6 

and the number of Lya photons emitted per baryon is 

kT 


e a = ep/{ 10.2eV) = 3.3/ c 


1.2 x 10 5 K 


log(r„/r c ). (27) 


Using T = 3x 10 6 K and r„/r c = 20 we get e a = 240 f c . With 
plausible values of f c ~ 0.2, this could amount to nearly 15% 
of the emission we observed, and more if the bias of this Lya 
emission is high. 

Cooling radiation is therefore the alternative source of 
Lya emission which comes closest to explaining our results, 
being plausibly less than an order of magnitude below our 
measurements. Cooling radiation could contribute in a sub¬ 
stantial way to the observed Lya emission. 


6.4 Radiative transfer effect 

The above estimates show that the Lya emission relevant 
to our clustering measurements is likely to be dominated 
by contributions from star-forming galaxies. Regardless of 
the origin, as long as Lya photons are scattered by neu¬ 
tral hydrogen, we expect them to be affected by a radiative 
transfer effect. The quasar-Lya cross-correlation function we 
measure suggests that this effect is detected, shown as elon¬ 
gated contours along the line of sight on scales as large as 
~ 20/i _1 Mpc. This feature cannot solely originate from the 
dispersion of the relative peculiar veclocity between quasars 
and galaxies (including redshift errors), but is consistent 
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with the predicte d main radiative effe ct on the clustering 
of Ly a emitters in IZheng et al.l d201 lal i . _ 

Th e radiative transfer effect predicted in IZheng et al.l 
d2011ah is a result of the anisotropic emission of scattered 
Lya photons from the anisotropic distribution of neutral 
gas density and velocity (hence anisotropic distribution of 
Lya scattering optical depth). In particular, Lya photons 
preferentially escape along the direction for which the neu¬ 
tral gas has the largest peculiar velocity gradient. Therefore, 
the Lya emission from less overdense regions along the line- 
of-sight direction would appear enhanced, suppresing the 
line-of-sight fluctuation. This dependece on the line-of-sight 
peculiar velocity gradient is analogous to that in the Kaiser 
effect but of opposite sign, leading to the elongated corre¬ 
lation contours. For all the possible origins of Lya emission 
discussed above, this radiative transfer effect should be at 
work as long as Lya photons interact with neutral gas on 
large scales. 

Our measurement can be used to constrain the param¬ 
eters relevant to the radiati ve transfer ef f ect. Fo llowing the 
simple model presented in IZheng et al.l (j2011al h the real- 
space overdensity 5 a (the Lya surface brightness fluctua¬ 
tions here) can be related to the matter linear overdensty 
field 5 and peculiar velocity field as 


1 + 3 a oc (1 -f- &lA) 


, , A , 1 dv z 

1 -I- aid + a 2 —— 

Ha oz 


(28) 


where bL is the Lya luminosity weighted bias of the un¬ 
derlying galaxy population and v z the line-of-sight peculiar 
velocity. The coefficient ai represents a combined effect of 
the dependence of the Lya radiative transfer (i.e., the Lya 
escape emission) on the density and the transverse peculiar 
velocity gradient, and 02 specifies the impact of th e line- 
of-sight peculiar velocity gradient (see IZheng et al.1 l2011al . 
in particular their Appendix A). Both coefficients are ex¬ 
pected to be positive. In redshift space, we also need to add 
the Kaiser effect, and in terms of the Fourier component of 
the Lya surface brightness fluctuations, we have 


5a,k = (b L + ai)[l + /3 a /r 2 ]dk- 
The j3 a parameter (as constrained in § 3.4) is 

n m (z = 2.55) 0 ' 6 


/3c = (1 - a 2 ) ; 


bL + ai 


(29) 


(30) 


The factor “1” comes from the Kaiser effect. The bias factor 
b a in Equation's 6i + ai. With /3 a = — 0.76±0.36, we have 
a 2 = 1 + ( 2.35 ± l.ll)(feq/3) ( N.B. this is the C v coefficient 
defined in lWvithe fe Diikstralfioilh . That is, the radiative 
transfer effect (indicated by a positive a 2 ) shows up or has 
been detected at a level of ~ 3.0cr for the fiducial value of 
b a = 3. 

Overall, the quasar-Lya cross-correlation provides ten¬ 
tative evidence to a new clustering effect caused by Lya 
radiative transfer. A better measurement with a larger data 
set and a more detailed modeling will help understand this 
effect and use it to constrain the neutral gas distribution. 


6.5 Escape fraction and detected fraction 

Assuming now that our fiducial value of the star forma¬ 
tion rate density is correctly inferred from the quasar-Lya 
cross-correlation, we conclude that it is consistent with the 


whole dust corrected star formation rate in lBouwens et all 
(l2O10ll . At face value, this indicates that dust has little ef¬ 
fect in reducing the Lya emission, i.e., almost 100% of the 
Lya photons produced from star formation escape. Clumpy 
dust clouds (INeufeldl 199ll l and anisotropic Lya escape 
(IZheng fe Wallaceli2014 ) may be possible mechanisms for ex¬ 
plaining the high escape fraction of Lya photons, together 
with the much lower one for continuum UV photons. 

We emphasize that the escape fraction is not the same 
as the fraction that is detected in Lya emitter surveys. 
The latter, the detected fraction, comes only from the 
central, high surface brightness part of the Lya emission 
from galaxies, and not from the extended Lya halos of low 
surface brightness dZheng et al.l 2011bl; Steidel et al.l 1201 ll ; 
iMatsuda et al.l 120121 ; iMomose et al] l2014T l. The detected 
fraction for Lya emission at z = 2 to 3, inferred from com¬ 
paring the Lya luminosit y density and H a or Ha luminosity 
densi ty, is about 5% fe.g. lHaves et al.ll201ll ; Iciardullo et al.l 
I2014T) . This fraction is consistent with the ratio between the 
star formation rate density inferred from Lya emitters and 
that from quasar-Lya cross-correlation in this work (see the 
blue and red points in Figure [LH) . 


7 SUMMARY AND CONCLUSIONS 
7.1 Summary 

We have carried out a cross-correlation analysis of residual 
light in SDSS/BOSS galaxy spectra and SDSS/BOSS 
quasars at redshifts from 2 = 2.0 — 3.5. We have used the 
Lya emission line (which is at wavelengths A = 3647 — 5471 
A at these redshifts) to trace structure in the cross¬ 
correlation. Our main findings are as follows: 

(1) We measure large-scale structure in the quasar-Lya 
emission cross-correlation at a mean redshift 2 = 2.55 at 
the 8 a level. The cross-correlation function shape is consis¬ 
tent with the linear ACDM correlation function. 

(2) Looking at the clustering as a function of separation 
across and along the line of sight we see evidence at the 
3.0a level for distortions of clustering of the form expected 
to be caused by radiative transfer effects. 

(3) We detect clustering independently in 4 subsamples 
at different redshifts, finding that the shape of the cross¬ 
correlation function is consistent with the fiducial sample. 
The amplitude of the cross-correlation increases by a factor 
of 3 between 2 = 3.5 and 2 = 2.0, although this detection of 
evolution is marginal, being consistent with no evolution at 
the 2.0(J level. 

(4) Although the spectra are too shallow to allow making a 
good catalogue of emission lines, we are able to weakly con¬ 
strain the contribution of emission lines to our signal sta¬ 
tistically by fitting lines, subtracting them and remeasuring 
the quasar-Lya cross-correlation function. We find that lines 
with luminosities (measured in our 1 arcsec radius aperture) 
of Z/Lyc > 8 x 10 41 erg s _1 may contribute 131*3% °f th e 
quasar-Lya cross-correlation amplitude at the relevant red¬ 
shifts, but this contribution is still consistent with zero. 

(5) In one of our sample tests, we measure the cross¬ 
correlation to be independent of quasar luminosity. This 
is evidence that the large-scale clustering of Lya surface 
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brightness we measure arises mostly from Lya emission as¬ 
sociated with star formation, and not from any systematic 
error associated with the quasar light. 

(6) We estimate the plausible contribution to the quasar- 
Lya surface brightness we measure from a variety of physical 
processes alternative to star formation galaxies that follow 
the same large-scale structure as quasars, such as fluores¬ 
cence or scattering of the quasar emission or the ionizing 
background, and we find these contributions to be almost 
all negligible. Only cooling radiation may contribute in a 
substantial fashion (perhaps at the 15% level) to the Lya 
surface brightness. 

(7) Using measurements of clustering from SDSS/BOSS 
quasars, we convert the measured amplitude of the cross¬ 
correlation function b q b a fp{fj,) to the product of mean Lya 
sky brightness at z = 2.55 and linear bias factor of Lya 
emission, finding (/r a )(6 a /4) = (3.9 ± 0.9) x 10 -21 erg s -1 
cnR 2 A -1 arcsec -2 . Assuming that this Lya surface bright¬ 
ness is due to star formation (see points (5) and (6) above), 
we convert our measured value to the mean star formation 
rate density in the Universe at redshift z = 2.55, finding 
Psfr = (0.28 ± 0.07)(3/6 a ) JV/Qyr -1 Mpc -3 . This is consis¬ 
tent with dust-corrected UV continuum based estimates of 
star formation, but more than an order of magnitude higher 
than previous estimates of the SFRD from surveys of indi¬ 
vidual Lya emitters. 

7.2 Implications 

We conclude that the high intensity of the Lya background 
at z — 2.5 that is derived from our measurement can only 
be reasonably produced by Lya -emitting galaxies that are 
clustered with quasars. If our measurement is confirmed, 
the consequences for our understanding of galaxy forma¬ 
tion and evolution are dramatic. The Lya emission directly 
observed in Lya emitting galaxies so far at these redshifts 
contributes only 0.01 Mq yr -1 Mpc -3 to the mean star for¬ 
mation rate density. Extended halos that have been seen 
around these Lya -emitting galaxies detected in surveys can 
only contribute an additional ~ 30% to this value. We have 
detected 21 — 35 (±lcr range) times more Lya photons than 
the Lya emitter surveys (with the uncertainty due to the fac¬ 
tor (3 /b a ), and we have argued that most of these photons 
also arise from star formation. This amount of star forma¬ 
tion represents most of the massive stars that are estimated 
to have formed in the universe and to have generated the 
present heavy elements. 

To contribute to our measurement, these Lya photons 
cannot be absorbed by dust before escaping the galaxies. 
The question that remains then is how this Lya radiation 
could have been missed by previous observations. Putting 
forward and testing a detailed scenario is beyond the scope 
of this paper, but we speculate that all star forming galaxies 
at these redshifts, even if they do not have any detectable 
Lya emission line in their central parts, are surrounded by 
low surface brightness halos that nevertheless have a high 
total integrated Lya luminosity. These low surface bright¬ 
ness halos could allow the bulk of Lya photons to be below 
the detectable levels in narrow band Lya emitter surveys. 
As much of the star formation in the Universe at redshifts 
2 = 2 — 4 occurs in massive galaxies, the implication is that 
a large fraction of the Lya emission we detect is from giant 


low surface brightness halos around massive, bright, star¬ 
forming galaxies, which absorb most of their continuum pho¬ 
tons to reradiate the energy in the infrared, and yet let their 
Lya photons escape. 

7.3 The future 

We note that the sky area covered by the million 
SDSS/BOSS (for our purposes randomly placed) fibers in 
the current study is ~ 3 x 10 s square arc seconds. This 
is approximately 1/200, 000 of the entire sky area, showing 
that Lya intensity mapping holds an enormous promise as a 
probe of structure in the Universe. In addition to the quasar- 
Lya emission cross-correlation employed in the present pa¬ 
per, one can imagine carrying out Lya forest-Lya emission 
cross-correlations and Lya emission autocorrelation mea¬ 
surements. Correlations of Ly/3 absorption and Lya emis¬ 
sion, and vice-versa would avoid common mode systematics 
in the fluxing of spectrographs and may reduce the possi¬ 
bility of related error. As the Lya signal is distributed on 
large-scales, a way to efficiently carry out intensity map¬ 
ping surveys (for example for baryonic acoustic oscillation 
experiments) may be to use integral field spectroscopy with 
relatively low angular (~ 10 arcsec) resolution on a wide- 
field (~ few deg.) telescope. If bright point sources could 
be masked such an instrument could in principle capture a 
dataset containing 5 orders of magnitude more information 
on large-scale clustering in the Universe than our present 
study. 
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APPENDIX A: STRAY LIGHT 
CONTAMINATION 

As mentioned in Section [3] a potentially very important sys¬ 
tematic is the contamination of the galaxy spectra by quasar 
light. This is particularly relevant for our cross-correlation 
measurement because we are searching for light from sources 
that are clustered with quasars, and light from the quasar 
itself could mimic this if not treated carefully. The issue 
arises because the 1000 fibers that orginate from the plate 
in the telescope focal plane are fed into (each red and blue) 
spectrograph and dispersed onto a single CCD. 1000 spectra 
are therefore extracted from a 4kx4K chip and some light 
from neighbouring fibers can end up in CCD columns that 
are centered on other fibers. 


A.l: Measured Quasar-Lyman-a surface brightness 
correlation 

We can test for this contaminating light by use of two facts 
about the observational setup. The fibers are arranged by 
fiber number (from 1-1000) on the CCD, so that for each 
spectrum we can measure light from a certain number of 
fibers away. Quasars and galaxies that we use in our cross¬ 
correlation will also often be measured on different plates, 
so that there is no possibility for this contamination. We can 
therefore also test our result by restricting the calculation 
of the quasar-Lya cross-correlation to pairs of quasars and 
galaxies lying on different plates. 

In the first test, we have computed the quasar-Lya 
cross-correlation of Equation [l] restricting ourselves to only 
quasar-galaxy spectrum pairs separated by specific values 
of Aflber, the difference in fiber number of the two spectra. 
Spectra with Afiber = 1, are adjacent on the CCD, for exam¬ 
ple, and have the highest potential for cross-contamination 
of light. We reduce the quasar-Lya cross-correlation for each 
value of Afiber by averaging the £ q a (r) results over a range 
from r = 5 — 50h -1 Mpc. Our conclusions about stray light 
contamination are not dependent on the range picked. 

The results are shown in Figure [T5l where we show £ qa 
(r = 5 — 50h _1 Mpc) against Afiber- In the plot the value 
of £ qa (r — 5 — 50/i _1 Mpc) for our fiducial sample (Sec¬ 
tion 13.111 , which uses all pairs with Afiber > 6 is shown as 
a horizontal line. We can see that for Afiber = 1, the mean 
surface brightness inferred from the cross-correlation is over 
20 times the fiducial value. It remains significantly higher for 
all Afiber < 4. This is an indication that even when quasar 
and galaxy spectra are separated by 3 other spectra that 
light from the quasar is able to leak into the galaxy spec¬ 
trum and contaminate it. Of course the particular region 
of the spectrum we are looking at (close to the Lya emis¬ 
sion line at the redshift of the quasar) is the one in which 
the quasar is extremely bright, and one would not expect 
it to contaminate other parts of galaxy spectrum as much. 
Nevertheless, for our project, this is the important region of 
the spectrum, and we therefore must apply a cut on Afiber- 
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Figure 15. A test of stray light from quasars in nearby fibers 
contaminating the galaxy spectra. We show the quasar-Lya emis¬ 
sion surface brightness cross-correlation averaged over quasar- 
pixel pair separations of r = 5 — 50/i — 1 Mpc plotted against Afiber? 
the difference in fiber number between the quasar and the galaxy 
spectra. A value of Afiber = 1 means that the quasar and galaxy 
were in adjacent fibers and their spectra where dispersed next 
to each other in the CCD. The error bars are jackknife errors 
computed using 100 subsamples of the data in each case. The 
horizontal solid line is the quasar-Lya emission surface bright¬ 
ness cross-correlation averaged over quasar-pixel pair separations 
of t* — 5 — 50/i — 1 Mpc for our fiducial computation (see Section 
13.ID , which uses information from all quasar-pixel pairs 6 fibers 
apart or greater. The dotted line (which lies close to the solid line) 
is the equivalent result but only using quasar-pixel pairs which 
are on different plates. 

Based on Figure H5l we have chosen that Afiber > 6 in our 
analysis. 

A related test is to compute £ qa (r = 5 — 50h x Mpc) 
for quasars and galaxies that are on different plates (and 
therefore not dispersed at the same time onto the CCD). 
We show the results as a dashed line in Figure [15] which is 
indistinguishable from the results from our fiducial analysis. 
This is good evidence that our cut Afiber > 6 is sufficient to 
eliminate stray quasar light. 

Yet another related test is to eliminate all galaxy fibers 
which have a quasar within a certain number of fibers. This 
is different from eliminating quasar-pixel pairs on a case-by- 
case basis because it will also eliminate any potential con¬ 
tamination which could come from quasars being clustered 
in space with other quasars. In the fiducial case, the con¬ 
taminating quasars will have been eliminated directly, but 
those which are clustered with them could still contaminate 
the neigbouring galaxy spectra. One good reason to believe 
that this is not occuring to a detectable degree is that any 
contamination of this type would be a convolution of the 
redshift space autocorrelation function of quasars and the 
contaminating surface brightness from the quasar lya line, 
which is highly asymmetric and extremely elongated along 
the line of sight (see below). This does not appear to describe 
the measured signal (e.g., Figure U). 

Nevertheless, we have carried out the test, which elim- 



Figure 16. The mean surface brightness from all DR10 quasar 
spectra as a function of wavelength. We shift all spectra to their 
rest wavelength and stack them with equal weight to make the 
curve. 

inates 50% of the galaxy fibers from the data sample. We 
find an amplitude b q b a fp(^) = 1.98tggg x 10 -20 erg s -1 
cm -2 A -1 arcsec -2 . and shape = 0.694g'3g. These are 
consistent at about the 2 a with the measurement from the 
fiducial sample. We note that the detection level of the signal 
is only about 3 a overall, compared to ~ 9cr for the fiducial 
measurement. This can be explained by the fact that elimi¬ 
nating all galaxies which have a quasar within 5 fibers will 
disproportionately affect the number of close quasar-pixel 
pairs. By directly counting, we find the number of quasar- 
pixel pairs with separations below 40 h -1 Mpc has fallen by 
a factor of 3.5 rather than the factor of 2 expected if pairs 
were drawn uniformly from all fiber separations. 

We have also carried out another, similar test, which 
eliminates a smaller fraction of the dataset, but which should 
have the same effect. For this test we remove from our list 
of galaxy pixels all pixels which have more than 1 quasar 
within r = 50/i -1 Mpc. In this way, we eliminate all po¬ 
tential cross-contamination from quasars clustered with the 
target quasars in the cross-correlation, but at a much re¬ 
duced cost (doing this only eliminates 0.3 % of the galaxy 
spectrum pixels). After fitting the cross-correlation of this 
sample, we find an amplitude b q b a fp(ij) = 3.041q 45 x 10 -20 
erg s' 1 cm -2 A -1 arcsec -2 . and shape = 0.331 q'4q. This 
is consistent with the fiducial result within the error bars. It 
is slightly lower (by less than 1 a), which should not be sur¬ 
prising, as we are presumably preferentially removing some 
pixels in overdense regions. Overall these two test results 
are a good sign that significant cross-contamination from 
quasars clustered with the target quasar is not ocurring. 

A.2 Modelling the stray light contamination 

If the excess surface brightness seen above in galaxy spectra 
which are close on the CCD to quasar spectra is indeed 
due to cross-talk from quasar light, we would expect the 
contamination to have a quasar-like spectrum. To examine 


© 0000 RAS, MNRAS 000, 000-000 











Lyman-a emission intensity 25 



Figure 17. Top panel: The stacked quasar surface brightness 
from Figure ITT] multiplied by 2.0 X 10 3 . The stack is centered on 
the redshift of the Lya line at redshift z = 2.55 and the fiducial 
cosmology has been used to convert A to comoving /i -1 Mpc. The 
stack is plotted as a function of r|| and is independent of r_\ . 
Bottom panel: The measured quasar-Lya emission correlation for 
quasar-galaxy pixel pairs separated by 1 fiber (Agber = 1) as 
a function of ry and r \ . In both panels a Gaussian filter with 
er = Ah ~ 1 Mpc was used to smooth the image (as in FignnTTli. 


this, we first make a stacked spectrum of the DR10 quasar 
sample, by averaging all the spectra together in the quasar 
rest frame with unit weight. This is shown in Figure fl6l We 
can see from that figure that the quasar Lya line is extremely 
broad, with a FWHM of ~ 50 A in the rest frame. There is 
also the noticeable emission feature due to NV on the red 
wing of the line. 

We next take this stacked spectrum, and convert the 
wavelength units into comoving h _1 Mpc at z = 2.55, the 
mean redshift of our measurements. If the quasar light is 
truly a contaminant, we expect the strength of the contam¬ 


ination to depend on the difference between fiber numbers 
(Afiber, as defined above) and not on the actual physical 
separation between quasar and pixel across the line of sight 
(r±). Plotting the quasar-Lya cross-correlation for partic¬ 
ular Aflber values as a function of r± is therefore a useful 
check that light contamination is occuring as we believe. 
The prediction for the contamination, taken directly from 
the quasar spectrum is shown in the top panel of Figure El 
In that Figure we have scaled the overall amplitude by a 
factor of 2 x 10 -3 (see below). We can see that the large 
width of the Lya line, and its asymmetry due to the NV 
feature are both apparent. 

The data with the strongest contamination should be 
from galaxy pixel-quasar pairs which are 1 fiber apart 
(Afiber=l). We therefore plot the quasar-Lya correlation for 
just those pixel-quasar pairs in the bottom panel of Figure 
El It is immediately apparent that there is a stripe across 
the middle of the plot corresponding to the contaminat¬ 
ing quasar Lya emission line. The contamination is clearly 
asymmetric, extending further to positive values of ry than 
negative, and overall is visually consistent with the predic¬ 
tion based on the stacked quasar spectrum (top panel of 
Figure fTTl) . Because the quasar-pixel pairs that are exactly 
1 fiber apart only comprise a very small fraction of the data, 
we expect there to be lots of noise in the plot, particularly 
for large values of r±. This latter is because close pairs of 
quasars and galaxies on the sky are more likely to be close 
in Afiber- There is neverthless enough range that we can ex¬ 
amine by eye whether the contamination depends on rj_, 
with the answer appearing to be no. We examine this more 
quantitively below. 

When comparing the top and bottom panels of Figure 
El we note that the amplitude of the stacked quasar spec¬ 
trum is scaled by a factor of 2 x 10 3 , which is the level 
required to match the observations in a \ 2 fit (see below). 
This means that 0.2 % of the quasar light is scattered into 
neighboring spectra (i.e. spectra with Agber = 1). 

If there is a signal that is actually coming from quasar- 
Lya emission correlations in the Universe, one would expect 
that to be superimposed on top of any contamination from 
stray light. We have therefore carried out a simple joint fit 
of £ qa with a sum of the fiducial theoretical model from 
Figure U (right panel) scaled by an amplitude factor ocdm 
and the stacked quasar contamination scaled by a factor 
flcontam- The fit is therefore as follows: 

£fit (r_L, 7*11 ) — flCDM^q ol,CDm{J" _L, F|| ) T Gtcontam^stack (r_L, 7|| ), 

(31) 

where £ s tack(rj_, ry) (multiplied by 2 x 10 3 ) is shown in 
Figure El (top panel). We separate the observational £ qct 
results by Agber and then carrying out a y 2 fit of the form 
given by Equation [31] to determine the best amplitude pa¬ 
rameters flcontam and bcdm as well as the confidence limits 
on those parameters. We carry out the fit for quasar-pixel 
pair separations r < 50/i _1 Mpc, although our conclusions 
are insensitive to this value. 

The results are plotted in Figure El For the Agber = 1 
dataset, we find a total ;y 2 f° r the best fit of 1526 for 988 
datapoints, which shows that the light contamination which 
dominates the fit is fairly well modelled by the quasar stack. 
It is perhaps not surprising the fit to the contamination is 
not perfect, as the contamination model is extremely simple. 
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Figure 19. The measured £ qQ from BOSS data averaged between rn = ±o0/? 1 Mpc and plotted as function of r ±. We show results 
for different values of Afib er as indicated in each panel. The points with error bars show the observational results. The prediction of the 
theoretical model from Equation 11 is shown as a dotted line. A dashed line indicates scattered light contamination of the sort discussed 
in Section 3.2 which is independent of r± and the solid line is the sum of the two components. 


We have plotted the 1,2 and 3 a confidence contours on 
the parameters ucdm and a CO ntam in Figure [T8] The central 
value of a CO ntam is 1.75 x 10 -3 . The 1 <r confidence interval 
on acDM is consistent with unity, and also with zero. The 
results from Afiber = 1 are therefore consistent with the sum 
of significant contamination from quasar light and a quasar- 
Lya correlation at the level predicted by the CDM model 
fit to the fiducial dataset. 

We have carried out the same fitting to datasets with 
different restricted values of Afiber in the other panels of 
Figure [T8l We can see that for cases with Afiber < 4 there is 
a significant detection of a contamination contribution. The 
value of acontam is lower for Afib er 2 —4 than for Afiber = 1 at 
the factor of 10 — 4 level. In all cases the measurements are 
also consistent with an additional quasar-Lya correlation at 
the level of ucdm = 1- For Afiber = 5 — 10 and Afiber > 6, 
the contamination is not detectable. 

The results of the fitting shown in Figure [18] are a good 
sign that the signal and contamination are behaving in a 
particular way which can be accounted for by excising pairs 
of pixels from close fibers. It is useful however to examine the 
results for individual datapoints in more detail. Of partic¬ 


ular interest is the measured quasar-Lya surface brightness 
as a function of r±. As mentioned above, we expect contam¬ 
ination at the CCD level for a given fiber separation to be 
independent of r_L. We have taken the measured £ qa (?~x, Ty) 
values for different datasets limited by Afiber and averaged 
them between rii = ±50/i _1 Mpc .We plot this average, (£ qa ) 
as a function of ri in Figure [19] In each panel we also plot a 
dashed horizonal line showing a level of contamination that 
is independent of r±, a dotted line showing the level of the 
signal from quasar-Lya clustering (we plot (£ qa )(rx) com¬ 
puted from the model shown in Figur^U right panel), and a 
solid line which is the sum of the two. 

We can see that in the Afiber = 1 panel there is the 
strong contamination signal which is consistent with being 
independent of r±. The small relative contribution of the 
actual signal makes little difference in this panel. In the 
Afiber = 2 and Afiber = 3 panels the level of contamination 
is lower (note that the y-axes are different in the different 
panels), but it still dominates over the expected signal, with 
again no sign of a dependence on r±. For the fiducial panel 
(bottom right), we can see that the measured (£ qa )(rx does 
have significant dependence on r±, and this has a similar 
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Figure 18. A joint fit of quasar scattered light contamination 
and cosmological model for £ qQ: (r) for different values of Aflber- 
We show 1,2 and 3 a confidence limits on the joint parameters 
ttcontam and ucdm from Equation [31] 

form and amplitude to the best fit model to the data in Sec¬ 
tion 3.2. There is no sign of a contaminating (independent 
of rx) component to this measurement. 

Having carried out these tests on contamination from 
the Lya emission line from nearby quasars, it is pretty clear 
that the signal we are seeing in our fiducial dataset cannot 
be caused by straightforward scattered light coming from 
nearby fibers. There remains the possibility however that 
there is an additional contaminating component from some 
other mechanism that has a dependence on r±. This is dif¬ 
ficult to imagine, but one can construct tests for this based 
on other spectral features than the Lya line. In the next 
two subsections we do this, first centered on the quasar CIV 
emission line and then using stars and the Ha line. 

A.3 Tests with CIV 

As a test of scattered light contamination, we carry out a 
different correlation between galaxy spectrum pixels and 
quasar positions. The only difference from our standard 
analysis (Section 3) is that we compute the cross-correlation 
at the wavelength of CIV at the redshift of the quasar. We 
use a rest wavelength of 1550 A for CIV (close to the center 
of CIV doublet). If our interpretation of the contamination 
from nearby quasars from the previous subsection is correct, 
we would expect to see strong contamination from the CIV 
line for low Aflb er and then no signal (\qciv for large Aflber 
(because there should be no strong intergalactic CIV line 
emission). 




0 10 20 30 40 

r (h 1 Vlpc) 


Figure 20. A null test using Carbon IV-quasar cross¬ 
correlations. Here we show £ q civ measured from BOSS data av¬ 
eraged between rii = ±50// —1 Mpc and plotted as function of r±. 
We show results for 2 different values of Aflb er as indicated in 
each panel. The points with error bars show the observational 
results. To aid comparison with previous plots, the prediction of 
the theoretical model from Equation 11 for the quasar-Lya cross 
correlation is shown as a dotted line. A dashed line indicates 
scattered light contamination of the sort discussed in Section 3.2 
which is independent of r±. 

As with Figure [T9] we plot (( qa )(ri), in Figure [20l We 
show results for Aflber = 1 and for Aflber > 6. In the case 
of Aflber = 1 there is strong contamination from the quasar, 
which does not depend on r±. The amplitude of the contam¬ 
ination is approximately three times smaller than that from 
Lya , which is the ratio one would expect from the relative 
strengths of the two lines in Figure 1161 For the Aflber > 6 
we see that there is no evidence for contamination, and, al¬ 
though the error bars are relatively large, there is no sign 
of any £qCiv signal. We have plotted the fiducial £ qa model 
on both panels with a dotted line. That the contamination 
behaves the same way as the Lya results in the previous sub¬ 
section but that there is no £qciv signal is further evidence 
for the reality of the £ qa measurement. 

A.4 Tests with stars 

A further test of stray light contamination which we can 
carry out is to cross-correlate the positions of stars with the 
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is consistent with zero, again a sign that the cut in Aflb er 
removes scattered light contamination. 

The results from the tests in Appendix A therefore sug¬ 
gest that although light contamination from quasars can be 
readily seen in the data, it can be eliminated to a high de¬ 
gree by excluding close pairs of fibers. When this is done, 
the resulting measured quasar-Lya surface brightness corre¬ 
lation seems to be real. If instead it is produced by some as 
yet unknown systematic this effect would have to reproduce 
the r± and ry dependence expected from a cosmological sig¬ 
nal, be measurable only in the Lya wavelength range and 
not around other emission lines, and be independent of the 
position of the spectra on the detector. 

Having searched for and as far as we can tell eliminated 
the effects of strong light contamination, we now turn to 
other potential systematic effects in the next 2 appendices. 


APPENDIX B: LARGE-SCALE SURFACE 
BRIGHTNESS CORRECTION 

A potential systematic error on our measurement is any ob¬ 
scuration (such as that produced by Galactic dust) that 
may affect both quasar selection and Lya surface bright¬ 
ness, thus producing a spurious cross-correlation between 
the two on the plane of the sky. One can correct for such 
a cross-correlation by searching for an £ qa (rr,ri|) signal 
which is non-zero for large values of r, where no physical 
(3D) correlation would be expected. This correlation which 
will be a function of rx only (in the parallel line-of-sight 
approximation) can then be subtracted from the £ qct sig¬ 
nal. We have done this, computing the surface brightness 
correction fj, a given by 


Figure 21. A null test using Ha - star cross-correlations. Here 
we show £ s h q measured from BOSS data averaged between ry = 
±50/r 1 Mpc and plotted as function of r±. We show results for 
2 different values of Aflber as indicated in each panel. The points 
with error bars show the observational results. To aid comparison 
with previous plots, the prediction of the theoretical model from 
Equation 11 for the quasar-Lya cross correlation is shown as a 
dotted line. A dashed line indicates scattered light contamination 
of the sort discussed in Section 3.2 which is independent of r±. 


galaxy pixels. We use all stars in the SDSS DR10 catalog 
(Ahn et al. 2014), which is 171612 objects in total. In this 
case we choose the Ha line, with rest wavelength 6562.8 
A and cross-correlate star positions with galaxy pixels cen¬ 
tered on this wavelength. We measure the resulting star-Ha 
cross-correlation, (£ s h a)(r± is for two values of Aflber, 1 and 
> 6. In this test, all the stars are obviously at the same rest 
wavelength, which effectively means that in this measure¬ 
ment we are stacking galaxy spectra together. This test is 
therefore likely to be more sensitive to the CIV test to resid¬ 
ual artifacts in photocalibration of the spectrometer (see e.g. 
Figure 4 of Busca et al. 2013). Nevertheless it is useful to 
see whether any contamination has an r± dependence. 

We plot the results for (£ S Ha)(rx in Figure [2ll where 
we can see that there is again a sign of stellar light contami¬ 
nation when Aflber = 1 and that this contamination appears 
to be independent of r±. For Aflber > 6, the measurement 


/i a (ri) 


1 

2 (X a ~ X b ) 



£ q «(rj|,rx 



?qa (ry,rx)dry 


( 32 ) 

where x a = 400 h J Mpc and Xb = 80 h 1 Mpc and £ qQ 
(r±. ry) is the qso-Lya emission cross-correlation. The value 
of g a (n) is shown in the bottom panel of figure l22l We can 
see that there is no coherent structure, as fi a {r±) becomes 
both positive and negative as r± varies. We have also plotted 
the best fit linear CDM model (as a function of r) for the 
qso-Lya cross-correlation (from Section 13.211 . and we can 
see that this dominates over the fi a (r±) correction on small 
scales r± < 40/i _1 Mpc, as we would hope. 

There is also another analogous correction, but one that 
applies in the orthogonal direction. Any systematic effect 
which affects the Lya emission in the line-of-sight direction 
(including redshift evolution in the Lya surface brightness), 
and which is correlated with evolution in the quasar popu¬ 
lation with redshift (or at least evolution in the efficiency of 
quasar selection with redshift) could produce a spurious qso- 
Lya emission cross-correlation. We compute how £ qct varies 
as a function of ry for large r± values where there should 
be minimal physical clustering. In this case the the surface 
brightness correction [i a is given by 


Mm) = 


(*2 - x b ) 


rx 

J Xu 


(rx, r\\)r±drx (33) 


where we use the same values of x a and Xb as in Equation 
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Figure 22. Large scale residual flux ( Equations I32l and l33l Neg¬ 
ative vales are shown in blue and positive values in red. 

cm The results for p, Q (r||) are shown in the top panel of 
Figure [22] We can see that in this case there is a signifi¬ 
cant trend in the surface brightness correction with [i a (r\\) 
gradually decreasing from positive values of ~ 4 x 10 -22 erg 
s -1 cm -2 A' 1 arcsec -2 for ry = — 150h -1 Mpc to ~ — 10 -23 
erg s -1 cm -2 A -1 arcsec -2 . for ry = + 150/i -1 Mpc. As with 
the previous (rj_) correction, the CDM model dominates on 
scales ri < 40/i -1 Mpc, where most of the clustering signal 
is located. 

In our analyses in the main body of the paper, we have 
applied these Ata(ry) and /Uc«(rx) corrections to the compu¬ 
tation of the qso-Lya emission cross-correlation £ qa . We 
have done this on a quasar-pixel pair basis, computing ry 
and rx for each pair, and then subtracting the appropri¬ 
ate value of fi a . In all cases, the effect of the correction is 
small, as we would expect given the much large amplitude 
of the clustering signal compared to the surface brightness 
corrections that can be seen in Figure 1221 For example, in 
our fiducial case, without applying the large-scale surface- 
brightness corrections, we find a best fitting values of the 
shape and amplitude parameters of Section 3.2 of bqb a fp(fi) 
= 3.5 erg s -1 cm -2 A -1 arcsec -2 . and fl m = 0.26, which 
represent differences of 0.5cr and 0.5<r in the parameters re¬ 
spectively. 


APPENDIX C: SAMPLE TESTS 

In this appendix we report on three consistency tests of 
our cross-correlation results. One test uses sky fibers to 
search for Lya emission instead of galaxy fibers, and the 
second looks at how the luminosity of the orginally targeted 



Figure 23. Fit parameters for the CDM model fit (as in Figure 
0 to the quasar-Lyct cross-correlation function (Equation0 using 
sky fibers. The point shows the best fit values of the amplitude 
(bqb a fp(/j.) ) and shape (O m ) and the contours show the 1, 2 and 
3rr confidence intervals. The open circle shows the best fit values 
of the fit parameters for the fiducial sample. 



Figure 24. The quasar-Lya cross correlation function £ qa (r) 
(Equation [lj computed using sky fibers instead of galaxy- 
subtracted galaxy fiber. The solid line shows a lit to the CDM 
model and the dash-dotted line shows the fit from our fiducial 
sample (Figure 0, which used galaxy-subtracted galaxy fibers. 

galaxy affects the quasar-Lya cross-correlation. The third 
test investigates how quasar luminosity affects the cross¬ 
correlation. 

C.l Sky fibers 

Ideally one would like to be able to use fibers positioned 
on random areas of the sky to carry out intensity mapping, 
without needing to worry about subtracting a foreground 
galaxy. This approach is being carried out for example by 
HETDEX (Blanc et al. 2011). In our case, however, there 
are a number of such random fibers which were obtained 
for each plate, to use in sky-subtraction. The number of 


© 0000 RAS, MNRAS 000, 000-000 




















































































30 R.A.C. Croft et al. 


fibers available for use to use is 146065, approximately 15% 
as many as there are galaxy fibers in our fiducial dataset. 
We have taken these sky fibers and carried out the quasar- 
Lya cross-correlation of Equation [I] The calculation was 
the same as in the fiducial case, including subtraction of a 
background from fiber (Figure [T]). 

We show the results for £ q a (r) for the sky fibers in Fig¬ 
ure m We can see that the datapoints appear to be consis¬ 
tent with the trend delineated by the best fit to the fiducial 
computation (also shown, as a dash-dotted line), although 
the measurement is much noisier. We fit the same CDM lin¬ 
ear correlation function as was carried out in Section cm 
and show the best fit parameters and confidence contours 
in Figure ESI We find b q b a fp(fj,) = 2.2+J;q x 10 -20 erg s -1 
cm -2 A' 1 arcsec -2 and Q m = 0.52 !q 3®. Here we can see 
that there that the confidence contours are very wide, and 
that the clustering signal is present at only the 2<r confidence 
level. The results are also consistent with the results from 
the fiducial sample at the 1.5 a level. At present therefore, 
this shows that the use of galaxy and sky fibers in computing 
the cross-correlation is approximately equivalent, although 
the uncertainty is obviously large. 

In the future, it would arguably be best to use ran¬ 
domly distributed fibers to carry out the cross-correlation, 
to avoid any potential selection biases. In the current sample 
case these biases are below the level of detectability, but one 
can imagine two types of bias, related to the fact that galaxy 
fibers are selected in regions of above average sky brightness 
and sky fibers selected to be in regions of below average 
sky brightness. Both effects could in principle bias the mea¬ 
surement of Lya surface brightness in opposite directions. 
We now carry out a further test to constrain this effect, by 
splitting the spectra into two halves bases on target LRG 
luminosity. 

C.2 Sample split by galaxy luminosity 

We divide the galaxy spectrum sample into two halves, based 
on the measured SDSS r band luminosity (no k-correction 
was applied). One half consists of galaxies above the median 
luminosity of the whole dataset (5.2 x 10 40 erg s -1 ), and the 
other half those below it. The median r band luminosities of 
the halves are 6.9 x 10 40 erg s -1 and 3.8 x 10 40 erg s -1 

We note that the LRG galaxy surface brightnesses mea¬ 
sured from BOSS spectra are of order 10“ 17 erg s -1 cm -2 
A -1 arcsec -2 , approximately 2 — 3 orders of magnitude 
brighter than the mean Lya surface brightness {^ a ) that 
we have measured in Section [o] We therefore expect that 
it is unlikely that regions of excess background Lya surface 
brightness could have caused certain LRGs to be preferen¬ 
tially selected and therefore bias our measurements. 

We measure the quasar-Lya cross correlation ( qa (r) for 
the two samples of spectra and show the results in Figure 
ESI Both subsamples show evidence of clustering that is con¬ 
sistent with the CDM model shape and the fiducial ampli¬ 
tude. This can be seen in a quantitative manner from Figure 
1261 where we show the fit parameters. We find b q b a fp{fi) 
=3.lip ? x 10 -20 erg s -1 cm -2 A -1 arcsec -2 , for the bright 
galaxy spectra and b q b a fp(R) =(3.8 ± 0.5) x 10 -20 erg s -1 
cm -2 A -1 arcsec -2 , for the faint galaxy spectra. Both sam¬ 
ples are consistent with the amplitude of the fiducial result 
at the la level (the fiducial sample fractional error bar is 
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Figure 25. Quasar-Lya cross-correlation £ qa (r) for two sub¬ 
samples of spectra originally targeted at LRGs with (a) luminos¬ 
ity above the median value and (b) below the median value. The 
solid curve shows the best fit CDM model and the dash-dotted 
line the CDM fit to the fiducial sample (all LRGs). 

bar + 42 S for fixed Q m = 0.30). This indicates that any bias 
present in the mean surface brightness ( fj , a ) due to fiber tar¬ 
get selection is likely to be at the ~ 10% level or less. 

C.3 Sample split by quasar luminosity 

A potential source of Lya emission clustered with quasars 
is recombination radiation from dense IGM systems illumi¬ 
nated by the quasars themselves (e.g., Kollmeier et al. 2010). 
We have made an estimate of the amplitude of this signal in 
Section [6] and find it likely that it is much smaller than the 
signal from star forming galaxies clustered with the quasar. 
One way of testing this directly is by splitting the sample 
into high and low luminosity quasars and then measuring 
the £ qa (r) signal for each. We note that this test can also 
function as a diagnostic for stray light contamination from 
quasars, as any stray light should be correlated with quasar 
luminosity. 

In this section we do this, measuring £ qa (r) the sub¬ 
sample of quasars with SDSS r band luminosity above the 
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Figure 26. CDM amplitude and shape parameters fitted to 
the Quasar-Lya cross-correlation £ qQ: (r) for two subsamples of 
spectra originally targeted at LRGs with (a) luminosity above the 
median value and (b) below the median value. The points shows 
the best fit values of the amplitude (bqb a fp(ii} ) and shape (Q m ) 
and the contours show the 1,2 and 3cr confidence intervals. The 
open circles show the best fit values of the fit parameters for the 
fiducial sample. 
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Figure 27. Quasar-Lya cross-correlation £ qa (r) for two subsam¬ 
ples of quasar data with (a) luminosity above the median value 
and (b) below the median value. The solid curve shows the best fit 
CDM model and the dash-dotted line the CDM fit to the fiducial 
sample (all quasars). 



median, and with the subsample with luminosities below the 
median. The median luminosity of the bright subsample is 
3.45 times the median luminosity of the faint sample, and 
the median redshifts of the quasars in each are z = 2.66 and 
2 = 2.44 respectively. 

We can see in Figures [271 that the £ q a (r) results for 
two subsamples do not look very different. In particular if 
the quasar luminosities were causing significant fluorescent 
emission from nearby IGM material one might expect the 
brighter subsample to exhibit a steeper £ qQ! (r) on small 
scales, which is not the case. From Figure[28]we can see that 
the fit parameters for the bright subsample are within 1 <t of 
the fiducial result and the faint subsample within 2er. The 
amplitude of parameter (b q b a fp(ii) ) (for Planck = 0.30) 
is actually slightly lower for the bright subsample, being 
(2.3 ±0.6) x IO -20 erg s -1 cm -2 A -1 arcsec -2 , whereas for 
the faint subsample it is 3.9j)Q g x IO -20 erg s -1 cm -2 A -1 
arcsec -1 . Given that the quasar subsamples are different in 
luminosities by a factor of 3.45 one would expect there to 
be a significant difference the £ q a (r) results in the oppo¬ 


site direction if quasar properties were significantly affect¬ 
ing the large-scale Lya intensity around them. As mentioned 
above, this lack of dependence on quasar luminosity is also 
strong evidence that stray light from quasars scattered in 
the spectrograph is not contaminating our measured cross¬ 
correlation signal. 


C.4 Other: sample split by fiber position in 
spectrograph 

It is possible to imagine other sample tests based on vari¬ 
ous observational parameters, which could help to identify 
sources of systematic error. One such test is to split the 
sample into two subsets based on position of fibers in the 
spectrograph. Of the 1000 fibers, those labelled with num¬ 
bers 100-400 and 600-900 are positioned towards the center 
of the spectrograph and the rest are positioned more towards 
the edge. The reason for such a test is that the optical qual¬ 
ity of the instrument cameras degrades towards the edge, 
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Figure 28. CDM amplitude and shape parameters fitted to 
the Quasar-Lya cross-correlation £ qa (r) for two subsamples of 
quasars with (a) luminosity above the median value and (b) below 
the median value. The points shows the best fit values of the 
amplitude ( b q b a fp(fi ) ) and shape (f2 m ) and the contours show 
the 1,2 and 3 a confidence intervals. The open circles show the 
best fit values of the fit parameters for the fiducial sample. 

and this might affect the signal to noise ratio and/or cause 
other problems. 

We carry out this split, and find that the average 
residual flux after subtraction of the model LRG spectra 
is somewhat different for the two subsamples. The resid¬ 
ual flux for the fiducial sample was plotted in Figure [D 
where we saw that it fluctuated between approximately 
±3 x 10 20 erg s -1 cm -2 A arcsec 2 for most of the rele¬ 
vant redshift range. In Figure [25] we plot for each subsam¬ 
ple (edge and center) A(SB res id U ai), the residual flux for the 
subsample minus the residual flux for the fiducial sample. 
We can see that there are indeed differences- the residual 
flux in fibers close to the edge of the cameras is system¬ 
atically higher than the fiducial sample (and the center of 
camera sample) by ~ 10 -20 erg s -1 cm -2 A arcsec -2 over 
the wavelength range ~ 4000 — 5000 A. As this covers the 
important Lya redshift range 2 = 2.3—3.0, this could indeed 
have consequences for our measurements. 

Using the appropriate mean residual flux for each 
subsample, we compute the quasar-Lya emission cross¬ 
correlation ( qa . It should be noted that the subsamples are 


x (A) 



Figure 29. The difference in average residual flux for two sub¬ 
samples of LRG spectra selected to be close to the center of of 
the spectrograph camera (blue line) or edge of spectrograph cam¬ 
era (red line). The residual flux is that left after subtraction of 
the best fit galaxy model from each spectrum, and the difference 
plotted here is the difference between each subsample and the 
fiducial result plotted as a black line in Figure [T] 

not quite the same size (there are 400 “edge” fibers per plate 
vs 600 “center” fibers). We fit the usual CDM model pa¬ 
rameters and find bqb a fp{fj) =3.9+0'! x 10 -20 erg s -1 cm -2 
A -1 arcsec -2 , and Q m = 0.31!q'Jq for the central fibers and 
bqbafpili) =(1.5±0.7) x 10 -20 erg s -1 cm -2 A -1 arcsec -2 , 
and = 0.44 )Zq' 33 for the edge libers. The central fiber 
results are within 1 u of the fiducial results, but the edge 
fiber parameters have larger errors and are 2.5 a below the 
fiducial results. We have also tried averaging the £ qQ , results 
from the two subsamples before fitting and find fit parame¬ 
ters consistent at the lcr level with the fiducial results. 

This test appears to show that the degradation of the 
optical quality of the spectrograph cameras close to the edge 
of their field of view does affect our ability to measure £ qc « 

. Given this degradation, it is reasonable to expect that the 
error bars for the edge sample are underestimated and that 
this may account for the 2.5 a difference with the fiducial 
sample. The fibers close to the center of camera sample ap¬ 
pears to be responsible for most of our ability to determine 
( qo . This information should be useful for future measure¬ 
ments of this type, but we do not use it to effect any changes 
to the fiducial analysis in this paper. 
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